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Wasted energy 


The burning off of gas during oil extraction is environmentally unsound and unjustifiable. The 
United States should instead be seeking to make use of this natural resource. 


pursuit of black gold, they simply burnt off the natural gas 

that was extracted from the rocks alongside their precious oil. 
It was a time when Earth's bounty seemed to expand without conse- 
quence in the face of human ingenuity and technological prowess. The 
incentive to invest in the infrastructure to capture the gas and bring it 
to market simply did not exist. 

But that is no longer true. There is no justification for the large- 
scale burning off, or flaring, of natural gas by today’s oil industry, 
particularly in the United States, which is home to the most mature 
and advanced oil and gas industry in the world. Nearly one-third of 
the raw gas that is pumped out of the Bakken shale formation in North 
Dakota — a prime target of the new hydrofracturing and horizontal 
drilling technologies — is burned in situ. 

True, flaring is preferable to venting gases such as methane, butane 
and propane directly into the atmosphere, butit still has a detrimental 
effect on both the global climate and the local air quality. And because 
companies are exempt from paying taxes or royalties on vented gas for 
the first year, at least, it is also bad for the public purse. Asa result, the 
public gets a smaller return on the environmental price being paid to 
recover this oil — the inevitable impacts on public infrastructure, air 
and water resources, and on the landscape itself. 

Ata conservative estimate, this North Dakota flaring meant some 
3.9 million tonnes of carbon dioxide were emitted last year, the equiva- 
lent of the annual emissions from 750,000 vehicles. Worse, research 
into flaring has begun to find evidence of potentially widespread 
methane leakage from shale operations, ifnot outright venting of the 
gas (see page 290). Methane is a powerful greenhouse gas, so the envi- 
ronmental price is likely to be even higher. 

There are solutions. In North Dakota, the state could halt the prac- 
tice of flaring except when necessary for safety reasons, or it could 
discourage companies from flaring by making them pay regular taxes 
and royalties on flared gas. This could delay the development of shale 
deposits, but that might be a good thing because it would give landown- 
ers and government agencies more time to work out how to regulate 
the environmental and social challenges that accompany energy booms 
such as shale exploitation. The companies have plenty of motivation to 
get shale resources out of the ground, and there can be little doubt that 
they would find ways to exploit the gas currently being flared, perhaps 
by exporting it. And methane emissions could be better controlled if 
the US Environmental Protection Agency regulated it as a greenhouse 
gas and instituted stricter rules across the oil and gas industry. 

The US shale boom has been a boon to the struggling economy, 
providing jobs and government revenue in many far-flung places. The 
resulting oil production has allowed the United States to reduce foreign 
imports, and the plentiful shale-gas resources have lowered demand for 
coal, thereby curbing greenhouse-gas emissions in the power sector. 
But it will be up to scientists to pin down the full suite of impacts from 


() ne could perhaps forgive the oilmen of the past. In their 


the new oil and gas developments and to help policy-makers better 
understand the choices that they are making. 

On 15 March, President Barack Obama proposed creating a 
US$2-billion Energy Security Trust to advance research and devel- 
opment into low-carbon transport alternatives. It would be funded 
over a decade by diverting a portion of the proceeds from federal oil 

and gas development, which are poised 


“The public getsa to grow thanks to the shale bonanza in 
smaller return on North Dakota and beyond. It is a good 
the environmental idea as far as it goes, although once again 
price being paid to it is hard not to despair at the general lack 


of ambition on climate issues in Wash- 
ington. Improbable as it may be, a federal 
carbon tax would raise more money and would send an important 
signal to the energy industry that it needs to control its greenhouse- 
gas emissions. 

The US economy is already benefiting from shale developments, 
and the country might even be able to lock in a one-time emissions 
reduction as part of a broader shift from coal to cleaner-burning natu- 
ral gas in the coming years. A logical part of that equation is to kill off 
the current fashion for flares. m 


recover this oil.” 


CITES for sore eyes 


Successes at last week’s wildlife-trade treaty 
meeting must be backed up with action. 


the Convention on International Trade in Endangered Species 
of Wild Fauna and Flora (CITES) last week, it became clear 
that something remarkable had happened. 

Preliminary decisions earlier in the meeting, which ran from 
3 to 14 March in Bangkok, had increased protection for a plethora 
of species and given cautious hope to zoologists and botanists. The 
final vote, by representatives of 170 of the states that are signed up to 
the treaty, rubber-stamped those decisions and sent researchers and 
conservationists out with broad smiles. 

The conference of the parties to CITES deserves praise and rec- 
ognition for placing a number of species of sharks and rays onto its 
Appendix II, which regulates trade in animals that are not deemed to 
be at immediate risk of extinction. 

CITES delegates seem to have discovered an overdue and very wel- 
come willingness to step on the toes of influential commercial interests 
that consistently oppose such restrictions, as they moved to protect 


A fter the final votes were cast at the 16th conference of parties to 
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marine species of huge commercial importance. They also saw the 
value in the wood from the trees: several species of tropical hardwoods 
were added to Appendix II. 

Key agreements on trade in elephant ivory were strengthened, to 
specify the need for campaigns to reduce demand. This is the best — and 
possibly the only — way to save elephant populations in the long term, 
say many researchers who study the illegal trade that takes poached 
ivory from Africa to markets in countries such as China and Thailand. 

There are already promising signs that public awareness of the threat 
to elephants is growing in China, as demonstrated by the campaign- 
ing of one of the country’s biggest (in more ways than one) celebrities, 
the basketball star Yao Ming. Similar campaigns against the trade in 
shark fins — whose status as a delicacy in some Asian countries is 
often blamed for declines in shark numbers — seem also to be finding 
a receptive audience. 

At the CITES meeting, there was also success for attempts to clamp 
down on the ivory trade through increased forensic scrutiny of seized 
ivory and the stockpiles of tusks kept in many African nations. This is 
a significant victory. Such DNA analysis should provide crucial infor- 
mation on the illegal trade and open up new avenues to combat it (see 
Nature 494, 411-412; 2013). It is also a validation of the hard work and 
campaigning put into this problem. The solid evidence base produced 
by the CITES projects Monitoring the Illegal Killing of Elephants and 
the Elephant Trade Information System has focused global attention 
on the resurging crisis of elephant slaughter. 

These successes carry important lessons. Popular and scientific 
reports of the threats to elephants, sharks and other species have 
helped to tip the political balance in favour of strengthening regula- 
tions. Non-governmental organizations deserve credit for raising the 


alarm, as do politicians for heeding the warnings. 

Let us not get carried away. There were also disappointments at 
CITES — notably the failure to stop trade in polar bears and their 
parts. Some conservationists also wanted even tougher moves to 
clamp down on elephant and rhino poaching, including trade sanc- 
tions, which were rejected. Overall, however, the post-meeting mood 
was jubilant — and rightly so. 

Serious questions must now be asked about the positive part that 
CITES can play in future marine conservation. If CITES wants to make 
progress in this sphere, then it must bring pressure to bear on special- 

ist fisheries-management bodies, many of 


“Delegates which have attracted criticism for allowing 
seem to have species such as tuna to be fished beyond sus- 
discovered a tainable limits. Those bodies have three years 
willingness to to put their houses in order before the next 
step onthe toes CITES meeting, or CITES will be obliged to 
of influential do it for them. (The convention does not have 
commercial a perfect record here, however — it failed in 


efforts to protect tuna populations in 2010, 
and dodged the issue at the latest meeting.) 

The decisions passed last week will not by themselves save a sin- 
gle animal or plant. Proper monitoring is essential. To build on the 
successes of the meeting, funders must provide stable financing for 
continued research on the welfare of those species that now fall under 
the protection of CITES — and those that do not. 

CITES took a great step forward last week. Its success should inspire 
all those who push for evidence-based policy. Perhaps most impor- 
tantly, it shows that international meetings that seethe with dissonant 
agendas and actors are not always toothless talking shops. m 


interests.” 


A pope for today 


Latest pontiff looks to enhance social relevance 
of Catholic Church. 


hether or not you are a believer, it is hard not to like the 
W = In the few days since the white smoke began to bil- 

low from the Sistine Chapel in Vatican City, the world has 
learned a little about Jorge Mario Bergoglio, the 76-year-old elected 
as Pope Francis I. The first pope from Latin America, as archbishop 
of Buenos Aires he eschewed the trappings of the office, forgoing a 
mansion for a small apartment, preferring to take the bus than use a 
chauffeur, and dedicating himself to pastoral work in the slums. The 
affable Pope Francis has also already wooed the public (and much of 
a fawning media) with his disarming humility and common touch 
— and his obvious flair for ad-libbing and humour. It is clear that 
Francis’s papacy marks a break with the past, a new distinctive and 
refreshing papal style, and an ambition to focus on social relevance 
and justice. “How I would like a Church which is poor and for the 
poor!” he said. 

Wealso learnt that the man obtained his first degree in chemistry, a 
later one in philosophy and another in theology, and that he has taught 
literature and psychology at universities. That broad education, aca- 
demic bent and humility are hardly a surprise because Bergoglio is the 
first Jesuit pope. The Jesuits, the largest order in the Catholic Church, 
are its intellectual elite and known for their independent thinking. 
They also vow to live lives of austerity and never to seek high office in 
the Church — let alone pope. They have focused on issues of social 
and economic injustice, and less on doctrine than do career clergy. 
They have long worked as missionaries, and are probably best known 
for their creation and running of some of the world’s top schools and 
universities. Many are also scientists. 
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We know little about Bergoglio’s views on scientific issues, which he 
has hardly written about. The hordes of scientists among the Church’s 
1.2 billion baptized members would like to hear more. And his chem- 
istry degree in itself says little about the Pope’s attitudes to science. 
But what is clear is that, contrary to widespread belief, the modern 
Catholic Church is science-friendly and Pope Francis will no doubt 
continue, and perhaps deepen, that tradition. The Church's strong 
support for Darwinian evolution, for example, contrasts sharply with 
the backwards unscientific belief in creationism of many US evan- 
gelicals and lawmakers — a concept that Pope Benedict XVI rightly 
criticized in 2007 as “absurd”. Priests also gave us Mendelian genetics 
and contributed to the theory of the Big Bang. 

Moreover, recent popes have substantially increased efforts to engage 
in dialogue with scientists on a host of issues, from embryonic stem- 
cell research and genetically modified crops to in vitro fertilization, 
abortion and euthanasia — and in the future will no doubt increasingly 
do so on advances in neuroscience and genetics, including prenatal 
screening. Scientists who have taken part in such discussions tell of 
thought-provoking and constructive debates, with the Church being 
open to ideas and often changing doctrines as a result. A damaging 
exception is its long-held opposition to the use of condoms to prevent 
the spread of HIV, and it can only be hoped that Pope Francis will have 
amore enlightened approach. 

But whereas doctrines can be tweaked, the Church will not compro- 
mise on its central dogmas, such as the sanctity of human life and that 
life begins at conception. Science and faith can provide complementary 
world views, with progress in science informing and often challenging 
the rationale of Church doctrines, and vice versa: faith can often add 
much-needed dimensions of ethics and social justice to advances in 
science and their impact on society. Clashes are inevitable between peo- 
ple of different beliefs, but both science and reli- 
gion are best served by building bridges across 
the divides. How Pope Francis responds to issues 
where the two meet will be an important mark 
of the man. = 
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WORLD VIEW jerninicor sen 


ANTED: An ambitious and skilled science administrator to 
Wow a continent-wide research programme with consid- 

erable impact and a steadily increasing budget. Diplomatic 
skills essential. Must be able to travel. 

One of the most appealing jobs in global science is now available. 

The European Research Council (ERC) is looking for a new president, 
and the European Commission, which guarantees the autonomy and 
integrity of the council, has set up an independent search committee to 
find just the right person. 

What makes the job so attractive? The answer is not too difficult. The 
ERC isa resounding success. Since its launch more than six years ago, it 
has supported more than 3,000 researchers. It is the jewel in the crown 
of European Union (EU) research. 

A decade ago, things were quite different. European scientists, many 
of whom hailed from countries that did not have 
national funding agencies, were fighting for a 
Europe-wide agency that could promote excel- 
lence in basic research, and could match what they 
saw in the United States. 

I was involved in that effort, which was univer- 
sally seen as a good idea, but difficult to get going. 
Early, cautious efforts aimed to create an ‘agency 
of the agencies. Only when the European Com- 
mission got involved did the concept begin to float. 

Because the European Commission will remain 
in the financial driving seat for the coming years, 
the most challenging part of the new job will be to 
work with the body. Commission-bashing is easy 
and fashionable, but it is usually superficial. The 
complexity of the commission's operations is daunting — to understand 
and accept that will require enormous patience, knowledge and com- 
munication skills. 

Change is never rapid, and an evolutionary rather than a revolution- 
ary type of personality may therefore be required. The commission and 
its representatives hold the ERC in high esteem, and they have made an 
impressive effort to reduce its administrative burdens. Nevertheless, 
there remains some way to go. 

The first two presidents of the ERC — Fotis Kafatos and Helga 
Nowotny — did an excellent job. And thanks to the ingenuity of the Sci- 
entific Council — the ERC’s strategy and decision-making body — and 
the strong commitment of the administrative staff, the ERC works well. 

This is not to say that the third president will be able to climb into a 
neatly made-up bed. To raise its attractiveness, the job has been rede- 
signed to be more powerful and focused than for the former presidents. 

The restructuring follows the 2011 recom- 


mendation ofa task force that investigated the DNATURE.COM 
ERC’s governance. The positions of president of Discuss this article 
the Scientific Council and of secretary general _ online at: 

have been fused to create a President sui generis, —_go.nature.com/htidbd 


THE JOB HAS BEEN 


REDESIGNED 


TO BE 
MORE 
POWERFUL 


AND FOCUSED. 


A chance to drive forward 
Europe’s science 


The new head of the European Research Council will have more power and 
focus than former presidents, says Ernst- Ludwig Winnacker. 


based in Brussels and thus able to deal directly on site with the various 
day-to-day tussles of the job. 

Still, president number three will face several challenges. One is the 
extremely uneven distribution of returns on investment to the various 
member states and associated countries. Strict adherence to the council's 
core principle of funding excellence alone has highlighted extreme dif- 
ferences in scientific competence over the European continent. More 
than 98% of the ERC’s grants are awarded to scientists in the old EU-15, 
with mere crumbs thrown to newer member countries. Although many 
of these countries have fine academic traditions, their supporting infra- 
structures tended to be neglected during the time of the iron curtain, 
and may take a generation or two to restore. The ERC cannot remedy 
this deficiency alone, but it could have an efficient advisory role. 

The ERC, like much of science, still has a gender problem. For exam- 
ple, 29% of applicants for a starting grant and 15% 
of applicants for advanced grants are women, yet 
the success rates for female scientists are consist- 
ently at least 2% lower than for men. The ERC is, 
of course, aware of this, and of the unconscious 
bias that may still be affecting peer review. The 
ERC must not only remain sensitive to the issue, 
but also find a more proactive way to tackle it. 

A particularly interesting problem is the 
evolution of the relationship between the ERC and 
the national funding councils. Some in Europe 
think that the national councils should dissolve 
and only the ERC survive. But mono-cultures are 
never ideal and can even be detrimental. 

It would be in the best interest of both the ERC 
and the national research councils to develop plans for continued coex- 
istence. A first step could be to give applicants from new member states 
access to the financial resources of other ERC nations. Bulgarian or 
Estonian scientists should be able to win money from the, say, German, 
Dutch or UK research councils, even if working in their home countries. 
Why should the rich countries participate? Because it would broaden 
the base of their own competitions and because it could rapidly improve 
the opportunities for scientists in the new member states to raise ERC 
grants, thereby reducing the unhealthy divide between East and West. 

Finally, and perhaps most importantly, the financial outlook for the 
ERC is positive. Its budget for 2014-20 will rise significantly. Not even 
our colleagues in the United States can say that. 

The ERC needs a president with a truly European mindset, and one 
who believes that it is important to build a broader base for frontier 
research. I hope our best and brightest will take note. = 


Ernst-Ludwig Winnacker was the first secretary general of the ERC 
and is now secretary general of the Human Frontier Science Program 
Organization in Strasbourg, France. 

e-mail: elwinnacker@gmail.com 
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Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


Noisy quantum 
calculations 


Researchers have proposed 
a way to make use of noise in 
quantum computing. 
Isolating systems from 
‘noisy’ environmental 
interactions has been the 
focus of much research. Jens 
Eisert and his colleagues 
at the Free University of 
Berlin, however, propose an 
alternate framework in which 
information is encoded ina 
quantum system combined 
with its noisy environment. 
To do this, the authors 
exploit an effect called the 
cutoff phenomenon, in 
which combined systems 
suddenly change states. This 
phenomenon defines points in 
time to launch calculations or 
read out results. 
Phys. Rev. Lett. 110, 110501 
(2013) 


Blood test for 
cancer DNA 


The DNA thatis shed into the 
bloodstream from dying cancer 
cells might one day be used to 
monitor the disease. 

Carlos Caldas at the Cancer 
Research UK Cambridge 
Institute and his colleagues 
identified genetic markers to 
track disease in 30 women 
being treated for advanced 
breast cancer. They found 
that a technique to detect 
these markers in cancer-cell 
DNA in the blood was more 
sensitive than tests that look for 
circulating tumour cells or for 
acancer antigen. The amount 
of tumour DNA also roughly 
correlated with the response to 
treatment. 

The authors suggest that 
blood tests based on tumour 
DNA could be quicker and less- 
invasive than biopsies. 


Fibres toughen when stretched 


When most tough fibres are stretched to 
make them thinner, they become brittle. But a 
group led by Yuris Dzenis at the University of 
Nebraska-Lincoln has shown that this is not 


always the case. 


The researchers made polyacrylonitrile 
fibres (pictured) using a technique called 
electrospinning. As their diameters narrowed 
to below 250 nanometres, the fibres became 
tougher and so were less prone to fracture, but 


N. Engl. J. Med. http://dx.doi. 
org/10.1056/NEJMoa1213261 
(2013) 

For a longer story on this research, 
see go.nature.com/luwx3m 


Flu vaccines may 
improve with age 


A subset of white blood cells 
may determine who is best 
protected by seasonal flu 
vaccines. 

Hideki Ueno at Baylor 
Research Institute in Dallas, 
Texas, and Octavio Ramilo 
at the Nationwide Children’s 
Hospital in Columbus, 
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did not lose their strength. Nanofibres were up 
to 10-fold tougher and stronger than the best 
commercial fibres. 

Dzenis suggests that the toughening is 


possible because the nanofibres are less 


(2013) 


Ohio, and their colleagues 
injected 49 adults and 

19 children with flu vaccine. 
The researchers tracked 

cell types in individuals 
over time and discovered a 
particular class of immune 
cells that is activated on 
vaccination. These cells 

can boost production of 
existing antibodies that fight 
flu, but they do not induce 


production of new antibodies. 


This suggests that the current 
flu-vaccine strategy may 

be less effective in young 
children and against rare 
types of flu, such as H5N1. 
Sci. Transl. Med. 5, 176ra32 
(2013) 
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crystalline than larger fibres. He thinks that the 
fibres could be used in load-bearing aerospace 

structures and bulletproof materials. 

ACS Nano http://dx.doi.org/10.1021/nn400028p 


Life on the 
seabed, and below 


The dark abyss of the Mariana 
Trench in the western Pacific 
Ocean hosts a surprisingly 
active bacterial community. 
By using an automated 
deep-sea instrument that they 
designed, a team led by Ronnie 
Glud at the University of 
Southern Denmark in Odense 
measured rates of biological 
oxygen consumption in the 
almost 11-kilometre-deep 
ocean trench. The high rates 
they found there suggest that 
degradable organic matter 


DIMITRY PAPKOV/JOEL BREHM/YURIS DZENIZ 


PHILLIP STOSSEL 


is being deposited at the 
bottom of the trench, allowing 
microbes to thrive in this 
secluded habitat. 

A second team reports that 
life exists in an even more 
unlikely environment. While 
at the University of North 
Carolina at Chapel Hill, Mark 
Lever and his colleagues found 
hydrogen-loving microbes in 
the oceanic crust, deep beneath 
the seabed off the west coast of 
North America. An incubation 
experiment over several years 
showed that the organisms 
can derive energy from 
geochemical reactions between 
iron compounds and the sea 
water that permeates through 
cracks in the rocky crust. 
Nature Geosci. http://dx.doi.org/ 
10.1038/ngeo1773 (2013); 
Science 339, 1305-1308 (2013) 
For a longer story on this research, 
see go.nature.com/ppzjaz 


Protecting DNA in 
silica ‘amber’ 


High temperatures and harsh 
chemicals degrade DNA, but 
not if it is protected by a skin 
of silica. 

Robert Grass and his 
colleagues from the Swiss 
Federal Institute of Technology 
in Zurich immobilized DNA 
on charged silica particles and 
grewa 10-nanometre silica 
layer on top. The encapsulated 
DNA was variously subjected 
to strong ultraviolet radiation, 
humid 120°C heat and attack 
by reactive chemicals. Each 
time, the molecule could 
be recovered by dissolving 
the silica with a weak acid. 

The DNA remained largely 
undamaged (much as fossilized 
DNA is protected in amber) 
and its sequence could be easily 
read. The researchers used a 
195°C press to create plastic 
pills (pictured) containing 

the protected 

DNA, which 


they suggest could be added 
to consumer goods as high- 
security barcodes. 

Angew. Chem. http://dx.doi. 
org/10.1002/anie.201208135 
(2013) 


Mobile worm 
microscope 


Smart phones may offer an 
affordable, portable way to 
diagnose intestinal worm 
infection, the most common 
infection in developing 
countries. 

Isaac Bogoch at Toronto 
General Hospital in Canada 
and his group transformed an 
iPhone into a microscope by 
attaching an inexpensive lens 
to its camera. Almost 200 stool 
samples from children in 
Pemba Island, Tanzania, 
were mounted on slides and 
studied using the microscope. 
Experts identified more than 
60% of worm-free slides, and 
nearly 70% of samples with 
eggs — of these, eggs from 
the large roundworm Ascaris 
lumbricoides had the highest 
rate of detection, at 81%. 

Although similar portable 
microscopes have been made 
before, most have stayed in the 
laboratory. The authors say 
that this is the first field-based 
clinical test of such devices 
for intestinal worms, and they 
predict that higher-resolution 
imaging will boost the accuracy 
of smart-phone diagnoses. 
Am. J. Trop. Med. Hyg. http:// 
dx.doi.org/10.4269/ajtmh.12- 
0742 (2013) 


Quakes shake the 
gold out 


Earthquake after earthquake 
may build up economically 
important gold deposits by 
concentrating the precious 
metal along quartz-rich veins 
under the ground. 

Dion Weatherley of the 
University of Queensland 
in Brisbane, Australia, 
and Richard Henley at the 
Australian National University 
in Canberra modelled jogs, or 
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COMMUNITY 


CHOICE 


Mice poor models for inflammation 


€ HIGHLY READ 
on www.pnas.org 
in February 


Ina systematic evaluation of how well mice 
mimic human inflammatory responses, the 
popular disease model receives poor marks. 


A large multi-institute group co-led 
by Ronald Tompkins of Harvard Medical School in Boston, 
Massachusetts, studied genome-wide gene expression in blood 
samples from 35 healthy people and 411 people with burns, 
severe blunt trauma or sepsis. They also used public databases 
to gather human data for other acute inflammatory stresses. 
Shifts in gene expression as a result of stress were highly similar 
across the different groups of patients, but correlated poorly 
with those in a mouse model. Such disparities could lead to 
misleading results in studies of disease or in the development of 
therapies, the authors say. The quality ofa disease model should 
be assessed not just for the appearance of disease but also for its 


biomolecular fidelity, they suggest. 


Proc. Natl Acad. Sci. 110, 3507-3512 (2013) 


cracks, that cross geological 
faults. Ground motions 

can drastically increase the 
volume of such cracks, causing 
liquids trapped in them to 
depressurize rapidly and 
precipitate out their minerals, 
say the researchers. 

This process, known as flash 
vaporization, could happen 
repeatedly in geologically 
active and gold-rich areas, 
such as Indonesia and South 
Africa. 

Nature Geosci. http://dx.doi. 
org/10.1038/ngeo01759 (2013) 
For a longer story on this research, 
see go.nature.com/6jwoxn 


Frog feet share 
human hair origin 


The toe pads that allow tree 
frogs to cling to slippery 
surfaces share their origin 
with human hair. 

Toe pads are complex 
structures that are held in 
shape by a rigid network of 
proteins. Wim Vandebergh 
and his colleagues at the 
University of Brussels found 
200 gene sequences expressed 
in the toe pads of the tree 
frog Hyla cinerea (pictured), 
but nowhere else in its toe. 


The authors identified five 


keratins that seem to be the 
main structural components 
of toe pads; the genes that 
encode these five proteins 
have the same evolutionary 
origin as those for the 
keratins found in mammalian 
hair follicles. 

The authors say that these 
proteins must have arisen in 
an early tetrapod ancestor 
before diverging to become 
hair in mammals and toe 
pads in frogs. 

Biol. Lett. 9,20130051 (2013) 
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SEVEN DAYS sescnisi 


Mars find 


NASA scientists reported on 
12 March that the agency's 
Curiosity rover had found 
evidence that Mars could have 
supported life billions of years 
ago. Rock samples collected 
by Curiosity contained clay 
that probably formed in salt 
water that was of neutral pH. 
See go.nature.com/ebzeey and 
page 292 for more. 


Methane success 
Japan announced on 12 March 
that it had extracted methane 
from methane hydrate deposits 
about 300 metres under the 
seabed. Methane, the main 
constituent of natural gas, 

has previously been obtained 
from deposits under frozen 
tundra, but marine methane 
hydrate reservoirs are much 
larger. The successful effort 

by the deep-sea drilling vessel 
Chikyu marks the beginning of 
a two-week pilot project that 
will assess whether methane 
hydrates could become an 
economically feasible source of 
energy for Japan. 


GSK absorbs Sirtris 


Sirtris Pharmaceuticals, 
which is pursuing drugs that 
combat diseases related to 
ageing, is to be absorbed by 
GlaxoSmithKline (GSK), 

a pharmaceutical firm 
headquartered in London. 
GSK acquired Sirtris, based in 
Cambridge, Massachusetts, 
in 2008 for US$720 million, 
but allowed it to function as 

a distinct unit. On 12 March, 
however, GSK confirmed that 
the Cambridge site would 

be closed and the firm’s lead 
drug candidate — a molecule 
believed to activate a protein 
called sirtuin 1 — folded 

into GSK’s internal drug- 
development programme. See 
go.nature.com/nhjjtn for more. 


Plague graves unearthed in London 


A medieval plague pit has been unearthed 
during construction of a Central London rail 
project, archaeologists announced last week. 
The roughly 660-year-old mass-burial site 
in Charterhouse Square was established to 
cope with the arrival in London of the Black 
Death — the bubonic plague. Researchers 


Seabed mining 

UK Seabed Resources in 
London, a subsidiary of 
aerospace firm Lockheed 
Martin UK, has been granted 
permission to harvest metals 
from the sea floor at a site 

in the Pacific Ocean around 
1,500 kilometres southwest 
of Mexico, the company 
announced on 14 March. 
See page 294 for more. 


AstraZeneca cuts 
Drug giant AstraZeneca 
says that it will cut some 
1,600 staff — of which 

1,300 are researchers— ina 
restructuring plan that will 
see scientists concentrated 
at three major research sites: 
Cambridge, UK (which will 
become the firm's new global 
headquarters), Gaithersburg 
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the disease. 


in Maryland and Mélndal in 
Sweden. The company will 
close down all research at 
Alderley Park, UK. The plan, 
announced on 18 March, will 
cost US$1.4 billion and will see 
the firm spend an additional 
$500 million on the UK facility 
in Cambridge. 


Drug-data deceit 
A UK researcher has for 

the first time been found 
guilty of manipulating safety 
data in experiments that 
supported clinical trials, the 
United Kingdom's drug- 
regulatory agency announced 
on 12 March. Working 

for the drug-development 
firm Aptuit in Edinburgh, 
Steven Eaton had since 2003 
selectively reported figures 
that were later used to assess 
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previously obtained the genome sequence of 
Yersinia pestis, the bacterium responsible for 
the plague, from human remains uncovered in 
a separate London burial pit (see Nature 478, 
444-446; 2011). The latest find might bring 
further insight into the history and genetics of 


the concentration of a drug in 
the blood. The manipulation 
meant that failed experiments 
were deemed to have worked, 
the Medicines and Healthcare 
products Regulatory Agency 
said. He will be sentenced in 
April for violating regulations 
set out in 1999 for good 
laboratory practice. 


Energy fund 

US President Barack Obama 
on 15 March proposed the 
creation of an Energy Security 
Trust to fund research and 
development into low-carbon 
transportation alternatives. 
The trust would be supplied 
with US$2 billion over 10 years 
by diverting a fraction of the 
federal royalties from oil and 


CROSSRAIL 


gas leases, which are poised to 
grow in the coming years. See 
w page 281 for more. 

pa) 


EU budget battle 

= The level of funding for the 

§ European Union's Horizon 

= 2020 research programme 

4 is uncertain again after the 

s European Parliament voted 
° against the proposed seven- 
year general budget on 

13 March. Heads of state agreed 
at a meeting on 8 February 
on an austerity budget of 
€960 billion (US$1.2 trillion) 
for 2014-20, which includes 
about €70 billion for Horizon 
2020. The parliament, which 
last November suggested a 
research budget in excess of 
€100 billion, approved the 
spending ceiling but wanted 
unspent funds from one area 
to be spent on projects in 
another rather than returned 
to member states, as well as a 
review of the budget in 2014. 


Species protection 
Multiple species of sharks and 
rays, including the oceanic 
whitetip shark (pictured), 
were granted enhanced 
protection at a meeting of 
parties to the Convention 

on International Trade in 
Endangered Species of Wild 
Fauna and Flora (CITES) in 
Bangkok last week. Various 
species of tropical hardwood 
trees were also added to 
appendix II of CITES, which 
places restrictions on their 


RE PICT! 


growth on bats’ muzzles, 
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22 states and 5 Canadian 
provinces, and has killed an 
estimated 7 million bats. 


TREND WATCH 


White nose syndrome, a fungal 
disease of bats that is sweeping 
through North America, was 
this month documented in two 
new states: Georgia and South 
Carolina. The deadly infection, 
named for the powdery fungal 


probably came from Europe, 
where the fungus (Geomyces 
destructans) is endemic. Since 
it first appeared in New York in 
2006, the fungus has spread to 


trade. Delegates accepted 
proposals to increase the 
forensic analysis of seized 
ivory to fight poaching of 
African elephants, but rejected 
enhanced protections for polar 
bears. See go.nature.com/ 
ro76uc and page 281 for more. 


Drinks ban busted 


A judge struck down a ban 
on the sale of large servings 
of sugary drinks in New 
York city on 11 March. 
Enacted by city mayor 
Michael Bloomberg's Board 
of Health, the ban would 
have gone into effect on 

12 March. But after beverage 
makers, grocers’ groups and 
restaurateurs sued, Justice 
Milton Tingling of the State 
Supreme Court ruled against 
it, calling the ban “arbitrary 
and capricious”. Bloomberg 
said his administration would 
appeal against the ruling. 
The ban would have limited 
restaurants, cinemas and 


BAT FUNGUS SPREADING 


White nose syndrome has spread to 
22 US states and 5 Canadian provinces. 


kiosks to selling sugary drinks 
in portions of 16 US fluid 
ounces (473 millilitres) or less. 


Mars mission boost 


Russia is partnering with the 
European Space Agency (ESA) 
on its ExoMars missions to the 
red planet. Roscosmos, Russia's 
space agency, will provide the 
launch vehicles and scientific 
instruments for both a planned 
orbiter in 2016 and a rover 

in 2018, as wellas the rover’s 
descent module and surface 
platform. The two agencies 
formalized the arrangement 
on 14 March. NASA, which 

in February 2012 backed out 
of its original plan to partner 
with ESA on ExoMars, will 
contribute telecommunications 
and other support. 


PEOPLE 


Generics limbo 

The head of generic drugs 
at the US Food and Drug 
Administration, Gregory 
Geba, resigned on 15 March 
after just eight months in the 
post. He leaves the Office of 
Generic Drugs just as it is 
being reorganized and takes 
on extra responsibilities. 
Legislation last year allowed 
the office to start collecting 
fees on applications for the 
approval of generic drugs, 
in the hope that the added 
resources would allow it 

to shorten the time from 
application to approval. 
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SEVEN DAYS | THIS WEEK | 


21 MARCH 

The latest data from the 
Planck space telescope’s 
measurement of the 
cosmic microwave 
background — the faint 
afterglow from the Big 
Bang — are unveiled 

in Paris. 
go.nature.com/rz31bm 


Engineering prize 
Tim Berners-Lee is one of 
five engineers to be jointly 
awarded the inaugural 
Queen Elizabeth Prize for 
Engineering for work that led 
to the Internet and the World 
Wide Web. The winners, 
who will share £1 million 
(US$1.5 million), were 
announced on 18 March. 


| FUNDING 
Spain fund freeze 


The European Science 
Foundation (ESF) has 
temporarily frozen most 
funding for networking 

events planned beyond June 

to researchers from Spain's 
National Research Council 
(CSIC) and Ministry of 
Economy and Competitiveness 
until the agencies pay late dues. 
Spain has also cut payments 

to CERN, Europe’ particle- 
physics laboratory near 
Geneva in Switzerland, and to 
the European Space Agency 
among others. The CSIC and 
ESF say that a resolution is 
forthcoming. 


CORRECTION 

The story ‘Carbon spike’ 
(Nature 495, 146; 2013) 
wrongly stated that the 
atmospheric concentration 
of CO; has risen by 70% 
from pre-industrial levels. 
The global average of 

395 p.p.m. represents a 
41% increase. 


> NATURE.COM 
For daily news updates see: 
www.nature.com/news 
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Shale-oil Mars-rover team Will deep-sea Marathon 
boom carries extra feels pressure to pick up mining finally take the experiments, and their 
carbon cost p.290 the pace p.292 plunge? p.294 lasting rewards p.300 


‘ St at 2 


Chimp Haven in Louisiana is the final destination for retired NIH research animals. 


Time called on chimp work 


NIH likely to retire hundreds of government-owned chimpanzees. 


BY MEREDITH WADMAN 


overnment-funded chimpanzee 
(Gs in the United States has been 

on notice for more than a year. This 
month, its time is probably up. 

First came a report from the Institute of 
Medicine (IOM) in December 2011, which 
found that most chimp research is unnecessary, 
given that almost all questions can be explored 
in other animal models or in vitro. 

Now Francis Collins, director of the National 
Institutes of Health (NIH), is expected to 
accept the bulk of the recommendations from 
an NIH working group that would retire sev- 
eral hundred agency-owned chimpanzees, 
leaving a core of just 50 animals available for 


NIH research. The working group, which on 
22 January released proposals for how the 
NIH should implement the IOM report, also 
said that the agency should end six of the nine 
invasive studies that it funds. And the other 
three would be allowed to continue only if 
the animals’ environments are substantially 
improved, probably at great expense. 

Should Collins accept the recommendations, 
it would be “a milestone in animal-research 
practice and policy, and an endorsement of the 
conclusion that there is a 
rapidly decreasing need 
for the use of chimpan- 
zees in research’, says 
Jeffrey Kahn, a bioethi- 
cist at Johns Hopkins 


For more on the 
controversy over 
chimp research: 


University in Baltimore, Maryland, who led 
the 2011 IOM committee. 

The working group did not identify which 
experiments it thought should end, so NIH 
researchers will learn the fate of their studies 
only after Collins’s decision. Several chimpan- 
zee researchers contacted by Nature declined to 
speak on the record about their work, but some 
lamented that unique science would disappear. 

The nine invasive biomedical experiments 
include one that is searching for an infectious 
trigger for primary biliary cirrhosis, a dis- 
ease that slowly destroys the liver and is fatal 
without a transplant. Another, conducted by a 
biotechnology company, tests an agent that 
binds to molecular targets on immune cells 
and is intended to shut down the immune 
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> response that causes diseases such as 
rheumatoid arthritis and lupus. The candi- 
date drug’s targets occur in only two species: 
humans — in whom safety tests of the com- 
pound would be ethically impermissible — and 
chimpanzees. If the study is killed, “this line of 
investigation will probably have to stop’, says 
one observer with knowledge of the study. 

Although the IOM committee was unani- 
mous in declaring chimpanzees unnecessary 
for almost all invasive research, it was split on 
whether the animal is essential for develop- 
ing a vaccine against hepatitis C, which kills 
some 350,000 people globally each year. The 
chimpanzee is the only animal model that can 
test whether an immune response to a candi- 
date vaccine would protect against hepatitis C 
infection. One of the nine studies now in jeop- 
ardy explores why the immune response in 
chimpanzees often cannot clear hepatitis C 
infections. 

“Everybody working with chimpanzees in 
the hepatitis C field realizes that they are really 
unique animals,” says Stanley Lemon, who 
studies hepatitis C at the University of North 
Carolina, Chapel Hill, and who resigned from 
the NIH working group last year. “But you have 
to balance that against the weight of human 
morbidity and mortality” 

The working group also recommended that 
five of 13 behavioural and comparative-genom- 
ics studies should end. The Yerkes National 
Primate Research Center in Atlanta, Georgia, 
hosts some of these studies, which include an 
assessment of the roles of the hormones vaso- 
pressin and oxytocin in social cognition, anda 
look at how genes and early social experiences 
affect brain development and behaviour. 

Even if Collins allows the Yerkes studies to 
continue, the recommendation for new hous- 
ing requirements could doom the projects. 
The working group proposed the provision of 
6.1 vertical metres of climbing structure and 
93 square metres of space for each animal; fresh 
materials for daily construction of nests; forag- 
ing opportunities; and housing in social groups 
of at least seven. “We estimate it could cost tens 
of millions of dollars for our centre alone to 
renovate space that will have limited use given 
our ageing colony,’ says Yerkes spokeswoman 
Lisa Newbern. 

Some ethicists say that if Collins embraces 
the recommendations, his decision will not only 
mark the end of most government-supported 
chimpanzee research, but also raise questions 
about work on other non-human primates. The 
NIH supports a network of primate research 
centres that houses some 27,000 non-human 
primates of 20 species. “We're not going to 
suddenly decide that all primates have got to 
be retired,” says Gregory Kaebnick, a research 
scholar at the Hastings Center in Garrison, New 
York, who last month led a study on changing 
ethical views on animal research. “But we might 
find ourselves trying to reassess the way we've 
been using them? = 


290 | NATURE | VOL 495 | 21 MARCH 201 


3 


Natural gas from oil wells in North Dakota is often burned rather than captured and sold. 


Oil boom raises 
burning issues 


Unburned methane could be adding to the environmental 
impact of gas flares in North Dakota. 


BY JEFF TOLLEFSON 


hen Paul Shepson flew his twin- 
propeller Beech aircraft over 
the Williston Basin in North 


Dakota last June, the perverse economics of 
a modern-day energy bonanza were on vivid 
display. Shepson, an atmospheric chemist at 
Purdue University in West Lafayette, Indiana, 
saw 10-metre-high flames writhing in the 
wind as shale-oil producers burned off the 
natural gas that was coming up along with 
the oil — a valuable resource that here, far 
from gas pipelines, is just a nuisance. 

In 2011, North Dakota's oil producers 
flared around 1.4 billion cubic metres of natu- 
ral gas, according to the Energy Information 
Administration (EIA) in Washington DC. 
That was nearly 32% of the total amount of 
gas produced by the state, and enough to 
heat around 700,000 houses. And the flaring 
increased in 2012. Scientists are just begin- 
ning to assess how the carbon dioxide from 
the flames adds to the climate footprint of the 
oil, and how black carbon and other pollut- 
ants affect local air quality. 
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But Shepson has also begun to detect 
evidence of a hidden threat: plumes of pure 
methane — a much more potent greenhouse 
gas than CO, — that is either leaking from 
the oil and gas infrastructure or being vented 
by extractors who are not flaring. Although 
he has yet to quantify the methane emis- 
sions, Shepson says that the energy coming 
out of the basin could have a steeper envi- 
ronmental price than many realize. 

Over the past several years, companies have 
been using the same hydrofracturing technol- 
ogies that have spurred a broader US gas rush 
to drill thousands of oil wells into the Bakken 
shale formation, which stretches across North 
Dakota, Montana and the Canadian province 
of Saskatchewan. High oil prices are driving 
the boom, which some predict could help to 
make the United States the world’s largest 
oil producer by 2020. Companies that build 
pipelines and gas-processing plants simply 
cannot keep up. 

Some 65% of the oil from the Bakken shale 
is now being shipped out by rail. But the raw 
natural gas that bubbles up along with it — a 
mixture of lighter liquid hydrocarbons and 
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gases such as propane, butane and methane— 
requires both pipelines and processing, says 
Rusty Braziel of RBN Energy, a consultancy 
based in Houston, Texas. Some of the liquid 
gas is shipped to Canada, where it is used 
to dilute raw bitumen from the Alberta tar 
sands for shipment back to the United States, 
but production has so far outpaced demand. 
“The companies would love to sell that natural 
gas,” says Rich Alonso, a partner at Bracewell 
& Giuliani, an industry lobbying firm based in 
Washington DC. But “it has got no place to go”. 


BURNING PROBLEM 
The upshot is flaring, and lots of it. North 
Dakota’s flaring rose more than sixfold 
between 2006 and 2011, and total US flar- 
ing increased by 62% over the same period, 
according to the EIA. Satellite data from 
researchers at the National Oceanic and 
Atmospheric Administration in Boulder, 
Colorado, show that even before the shale-oil 
boom, the United States was flaring more gas 
than all but a handful of countries, including 
Russia, Nigeria, Iran and Iraq. 
Environmentalists are pushing North 
Dakota to halt flaring and require that the gas 
be used to produce electricity or be processed 
into usable products, such as fertilizer. Scien- 
tists and entrepreneurs are working on mobile 
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The proportion of the natural gas being vented or flared during shale-oil operations in North Dakota hovers at 
around one-third, but carbon dioxide emissions have grown with booming production. 


w 
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refineries that could be deployed in the field 
to process the stranded gas. And at the fed- 
eral level, environmentalists are asking the US 
Environmental Protection Agency to regulate 
methane emissions under the Clean Air Act. 
But there is no obvious resolution in sight. 
“It’s being treated as an infrastructure 
problem, but I see it first and foremost as a 
regulatory problem,” says Brad Crabtree, pol- 
icy director for the Great Plains Institute, an 
environmental advocacy organization based 
in Minneapolis, Minnesota. “The industry can 
afford to manage the gas and should be required 
to do so? Industry officials say that gas infra- 
structure is already expanding and argue that 
regulations would only slow that development. 
Shepson initially focused his investigations 
on the flares, hoping to measure the pollutants 
they emit and how much methane was escap- 
ing unburned. He and his crew flew through 
the plumes rising from 11 major flares, and 
found that they were highly efficient, burn- 
ing all but around 0.1% of the methane. But 
the team also documented numerous plumes 
of pure methane, suggesting that natural gas 
is escaping directly into the atmosphere in 
many locations. This is particularly troubling 
because, molecule for molecule, methane has 
roughly 25 times the warming power of CO). 
Exactly how much methane is escaping 
remains unclear, however, as does its source, 
although some suspect that flaring is not as uni- 
versal a practice as believed. “We don’t have an 
estimate for how leaky the Bakken is in terms of 
methane emissions, but if you are going to flare 
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the gas anyway you probably aren't going to be 
too careful, says David McCabe, an atmos- 
pheric scientist at the Clean Air Task Force, an 
environmental group based in Boston, Massa- 
chusetts, which funded the work. 

Scientists are just beginning to incorporate 
this information into life-cycle greenhouse-gas 
assessments of shale-oil developments. The oil 
coming out of the Bakken formation is high- 
quality light crude, which produces fewer CO, 
emissions than thick oil from the Canadian tar 
sands or Venezuela. But no one knows for sure 
how natural-gas leakage and flaring affect that 

equation. Assuming a 


“Ifyouare going best-case scenario in 
to flare the gas which all the meth- 
anyway, you ane is combusted, 
probably aren’t McCabe estimates 
going to be too that the flares would 
careful.” have generated nearly 


4 million tonnes in 
CO, emissions over the course of 2012 (see “Up 
in smoke’), akin to running a medium-sized 
coal-fired power plant in the prairie. The leak- 
ing methane could add substantially to that 
climate impact. 

Shepson says he hopes to get a clearer 
reading of the methane leakage during a sec- 
ond round of investigations this year, but he is 
worried about the big picture. “When you fly 
over this region, what hits you is the scale of the 
development,” he says. “We are reinvesting in 
an aggressive way in fossil fuels, and the price 
of all of that will be a more certain and larger 
impact from climate change.” mSEEEDITORIALP.281 
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PLANETARY SCIENCE 


Mars rover under pressure 
to reach mountain goal 


Curiosity’s memory glitch prompts mission scientists to pick up the pace. 


BY ALEXANDRA WITZE 


even months after it hurtled to a landing 

on Mars, NASA's Curiosity rover has 

rolled just 738 metres across the surface. 
It remains about 10 kilometres from its pri- 
mary goal: Aeolis Mons, a 5-kilometre-high 
mountain thought to contain layers formed in 
ancient ponds. With the mission nearly one- 
third of the way through its allotted two years, 
the science team is beginning to feel the pres- 
sure to get moving. 

If Curiosity were to die tomorrow, “we could 
certainly say we achieved a lot’, says science- 
team member Laurie Leshin, a planetary geo- 
chemist at Rensselaer Polytechnic Institute 
in Troy, New York. “But none of us would be 
satisfied” 

It took two months just to unfurl the array 
of scientific instruments on the US$2.5-billion 
Curiosity — the biggest, most expensive and 
most lavishly equipped rover ever sent to 
Mars. It is also planetary exploration’s most 
complicated robot ever, with about 400 scien- 
tists working on ten different instruments: a 
social environment that can slow things down. 
But the first serious glitch of the mission — a 
memory malfunction on 28 February that 
forced operators to use the rover’s back- 
up computer — has reminded Curiosity’s 
handlers that the machine is mortal. 

From the start, however, mission scientists 
have found plenty of intriguing distractions. 
Soon after touching down in Gale Crater on 
6 August last year, the rover found evidence of 
a dried-up stream bed near the base of Peace 
Vallis, an 80-square-kilometre fan of sediments 
that washed down from the walls of the crater. 

In October, after heading to Glenelg, a junc- 
tion where three terrains come together, the 
rover stopped for almost two months at a drift 
of sand dubbed Rocknest. Here, Curiosity 
plunged its sampling scoop into the sand and 
tipped the material into its two main analyti- 
cal instruments, CheMin (for Chemistry and 
Mineralogy) and SAM (Sample Analysis at 
Mars). On 19 March, an entire session of the 
Lunar and Planetary Sci- 


ence Conference atthe DNATURE.COM 
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to the findings at Rock- _ Nature's special: 
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out to be chemically similar to soils analysed 
elsewhere on Mars, suggesting that materi- 
als such as water move between the soil and 
atmosphere in similar ways around the planet. 

Also at the conference, scientist Chris 
Webster of NASAs Jet Propulsion Laboratory 
in Pasadena, California, reported the latest on 
methane measurements taken by the rover’s 
atmosphere-sniffing, tunable laser spectrom- 
eter. The spectrometer team’s findings have 
lowered the upper limit for methane in Mars’s 
atmosphere — to as little as 3 parts per bil- 
lion by volume. The scarcity of methane casts 
doubt on the possibility that Martian microbes 
are emitting the gas into the atmosphere (see 
Nature 491, 174; 2012). The instrument has 
been used just three times in the first 200 sols 
(Martian days), even though it can be activated 
at night when there is less demand from other 
science instruments on the rover. This is a sign 
of the intense competition for rover time: one 
of the reasons for the slow pace. 

Since 23 January, the rover has more or 
less stayed in one place, snuggled up next to a 
reddish rock nicknamed John Klein, ina region 
called Yellowknife Bay. At John Klein, Curiosity 
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used its drill for the first time, and analysis of 
the powdered drilling sample revealed clay 
minerals that formed in a watery environment, 
along with oxidized and reduced forms of sul- 
phur. That suggests that Yellowknife Bay was 
once a benign environment for microbial life 
(see Nature http://doi.org/ktt; 2013). Yellow- 
knife “has been a great place to start’; says team 
member Roger Wiens of Los Alamos National 
Laboratory in New Mexico. “We wouldnt have 
wanted to miss this for anything” 

But Jorge Vago, project scientist for the 
European Space Agency’s ExoMars mission, 
says that he is already thinking about how he 
will need to do things faster with his rover, 
which is scheduled for launch in 2018. “Clearly, 
we couldn't afford to run ExoMars in the way 
[Curiosity] is doing the mission,’ he says. 
Instead, ExoMars is likely to have a shorter 
mission length, so might need to perform 
its science at a pace somewhere between the 
tortoise speed of Curiosity and the hare-like 
rate of NASAs Spirit and Opportunity rovers, 
which landed on Mars in 2004 with primary 
missions of a little over three months. 

Just as Curiosity’s science team is getting 
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itchy to move the rover again, however, it will 
have to take another long break. For most of 
April, Mars will be behind the Sun as seen 
from Earth, and no spacecraft on or around 
Mars will be able to radio home. After Mars 
emerges, the team aims to drill into the John 


Klein rock once more, to repeat the measure- 
ments. Then the rover will finally hit the road 
to Aeolis Mons, also known as Mount Sharp. 
Curiosity will follow a seemingly flat, unob- 
structed path around a set of sand dunes (see ‘A 
long road’) at speeds of up to 100 metres a day, 
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says Raymond Arvidson, a planetary geologist 
at Washington University in St Louis, Missouri. 
“T call it the rapid transit route,” says Arvidson. 
“There's general agreement that we have to get 
to the base of Mount Sharp before the primary 
mission ends, with a lot of time to spare. = 
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Gene-analysis firms 
reach for the cloud 


Online bioinformatics companies rush to provide genomics 
platforms and software for hospitals. 


BY ERIKA CHECK HAYDEN 


ish Genetic Disorders in New York, DNA 
sequencing is the easy part. It costs less 
than US$1,500 per person to have the impor- 
tant parts of his clients’ genomes sequenced. But 
it would be dauntingly expensive to maintain 
servers and staff to analyse the data and iden- 
tify mutations that might be causing the undi- 
agnosed diseases that afflict his clients’ families. 
So Jalas, the centre’s director of genetics 
resources and services, has outsourced parts of 
the analysis. He uploads his clients’ sequencing 
data to cloud-computing software platforms, 
where he can run analyses without having to 
set up the infrastructure in-house. The cost is 
about $100 per person. And the cloud-based 
interfaces let him collaborate with doctors in 
Israel without worrying about repeatedly trans- 
ferring data on slow Internet connections. “For 
me, it is convenient and cost-efficient; he says. 
Jalas and the way he works represent a 
new and mostly untapped market for a new 
crop of genetics interpretation and analysis 
firms, which will be touting for customers at 
a meeting of the American College of Medical 
Genetics and Genomics in Phoenix, Arizona, 
on 19-23 March. Dozens of these firms have 
emerged, some in the past year, as ever more 
affordable sequencing moves from academia 
into the clinic (see Nature 494, 290-291; 2013). 
Doctors will increasingly want to use sequen- 
cing data to guide decisions about patient care, 
but might not necessarily want to invest in staff 
and software to make sense of those data. 
“It’s a huge unmet need,” says David Ferreiro, 
a biotechnology analyst with investment bank 
Oppenheimer & Company in New York, which 
invests in many sequencing-technology and 
-analysis companies. 
Where there is a need, there is also money to 
be made. The companies, many of them based 


FE or Chaim Jalas at the Center for Rare Jew- 


in California's Silicon Valley, have been tempted 
bya market in outsourced sequencing and anal- 
ysis software that by 2016 could top $4 billion 
per year, according to BCC Research, a market- 
research company in Wellesley, Massachusetts 
(See ‘Genes in the marketplace’). “The sky’s the 
limit, says Andreas Sundquist, chief executive 
of DNAnexus in Mountain View, California, 
which provides genetic-analysis software on 
its cloud-based platform and allows users to 
upload and run their own algorithms. 

Other firms offer a range of approaches. 
Seven Bridges Genomics, based in Cambridge, 
Massachusetts, aims to be accessible to people 
with no expertise in bioinformatics, and pro- 
vides access to free tools for designing custom- 
made analysis pipelines. Ingenuity Systems 
in Redwood City, California, allows users to 
upload a list of mutations in a person's genome, 
and finds those most likely to cause disease. 
Personalis, down the road in Menlo Park, offers 
sequencing services and interpretation for cli- 
nicians and pharmaceutical and biotechnol- 
ogy companies. Last week, the company won a 
$1.53-million contract with the US Department 
of Veterans Affairs to look for genetic variants 


GENES IN THE MARKETPLACE 


The market for outsourced gene-sequencing work 
and custom-made software is projected to grow 
more quickly than that for sequencing machines. 
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in samples from as many as one million military 
veterans, to explore the variants’ roles in disease. 
(The company will outsource the sequencing to 
Illumina, a market leader in sequencing tech- 
nology based in San Diego, California.) 

The activity is reminiscent of that a decade 
ago, when bioinformaticians started up a flurry 
of companies, most of which were unsuccessful 
because the path from a genetic-disease marker 
to a profitable drug has not been straight- 
forward. Today’s companies have moved on to 
other challenges, says Steven Brenner, a com- 
putational genomicist at the University of Cali- 
fornia, Berkeley. He says that they will have to 
prove that their products are better than freely 
available software — and do so without disclos- 
ing their intellectual property. “These compa- 
nies havea tricky situation,” he says. 

And they are facing competition from the 
sequencing-technology firms themselves. 
Last year, [lumina opened BaseSpace Apps, 
a marketplace for online analysis tools to be 
used on data uploaded to the company’s own 
cloud-computing platform. Information-tech- 
nology companies are getting in on the action, 
too: Oracle, based in Redwood Shores, Cali- 
fornia, has its own products aimed at helping 
researchers and hospitals to analyse data. 

But one of the biggest questions will be 
how deeply analysis companies can reach into 
medical settings, where privacy concerns are 
paramount. Hospitals can be fined if patient 
privacy is compromised, and clinical geneti- 
cists may be uneasy about uploading data to 
the cloud. “It’s your licence and your lab that go 
on the line when it comes to reporting a clini- 
cal result,’ says Elizabeth Worthey, director of 
genomic informatics at the Human and Molec- 
ular Genetics Center at the Medical College of 
Wisconsin in Milwaukee. That is a large part of 
why many hospitals have so far chosen to build 
their own analysis infrastructure, says Ferreiro. 

To address privacy concerns, Bina Tech- 
nologies in Redwood City sells a server that 
can sit in a customer’s own data centre and 
is optimized to run genome-analysis soft- 
ware. Knome of Cambridge, Massachu- 
setts, announced last year that it plans to sell 
$125,000 genome-analysis machines for use 
in customers labs (see Nature 490, 157; 2012). 

It seems unlikely that any single analysis 
company will rule the market; the range of cus- 
tomers who need to interpret sequence data is 
growing, and each has their own needs. “We 
would rather see a thousand flowers bloom,” 
says Sundquist. “I don't think we're going to 
see consolidation anytime soon.” m 
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WEALTH FROM THE SEA 


Systems are being developed to harvest mineral-rich nodules from the sea floor. In this scheme, the harvester 
would vacuum or rake up the nodules, grind them and mix them with sea water. The slurry would be piped to 


ships and then processed onshore. 


Nodules ——e 


UK company pursues 
deep-sea bonanza 


After decades of preparation, firm moves forward with 
plans to harvest mineral-rich nuggets from sea floor. 


BY MARK SCHROPE 


bizarre episode from the cold war 
Az foreshadow a new era of seabed 

mining. In 1974, the US government 
used a ship owned by eccentric billionaire 
Howard Hughes to pick up the remains of a 
Russian nuclear-missile-bearing submarine 
that sank in more than 5,000 metres of water. 
As cover, the United States claimed that 
the expedition was collecting polymetallic 
nodules: rocky lumps that litter the deep sea 
floor, rich in metals such as manganese, nickel, 
copper, cobalt and rare-earth elements. 

It turns out that Lockheed, the US defence 
contractor that made the equipment used to 
grab the sub, had a real interest in the miner- 
als. And last week, its London-based affiliate 
Lockheed Martin UK announced that it is to 
explore the harvest of nodules in the Pacific 
Ocean, 1,500 kilometres southwest of Mexico. 

If it were to go ahead, the mining would be 
the culmination of decades of technological 
effort and exploration. “Lockheed Martin (US) 
in the end did more development and at-sea 
testing than any other company,’ says Caitlyn 


Antrim, executive director of the Rule of Law 
Committee for the Oceans in Washington DC, 
and a deep-seabed-mining consultant. The 
venture could add £40 billion (US$60 billion) 
to the UK economy over the next 30 years, said 
UK Prime Minister David Cameron last week. 
Cameron’s government sponsored Lock- 
heed’s claim to the 58,000-square-kilometre 
potential mining site through the company’s 
subsidiary UK Seabed Resources in London. 
The International Seabed Authority in King- 
ston, Jamaica, which granted the exploration 
rights, has also granted claims in the region 
to several other countries, but work towards 
harvesting has been slow. The UK effort has 
advantages, says Antrim: Lockheed has proven 
technologies and the most nodule-bed data. 
Polymetallic nodules form over thousands 
of years on the sea floor, through processes 
that are still not fully understood; most nod- 
ules range in size from the diameter of a golf 
ball to that ofa large potato. Between the 1960s 
and the 1980s, various companies explored the 
possibility of harvesting them — but as metal 
prices fell, the prospect became economically 
unfeasible. In recent years, however, climbing 
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prices and advances in deep-sea equipment 
such as remotely operated vehicles and flex- 
ible risers — pipes used to lift material — have 
revived interest. “It’s part of the art of the possi- 
ble now,’ says Miles Libbey, a former employee 
of Lockheed Martin US, and the founder of 
Seaminr, a seabed-mining consultancy in 
Urbanna, Virginia. 

Harvesting nodules is technically easier 
than mining the mineral-rich remnants of 
non-active hydrothermal vents on the sea 
floor, a proposal that has also garnered atten- 
tion (see Nature 447, 246-247; 2007). Nodules 
rest unattached, and often visible, in sea-floor 
sediments, so obtaining them would not entail 
grinding up rocky sea-floor crust. Lockheed 
has not released specific plans, but the gen- 
eral scheme for harvesting uses a device that 
moves along the sea floor, vacuuming or rak- 
ing up nodules. The equipment would grind 
down the nodules to create a slurry that would 
be pumped to ships and then transported 
onshore for processing (see “Wealth from 
the sea). 

The most lucrative product would probably 
be nickel, which makes up only about 1% of 
the nodules but is used in everything from 
stainless steel to rechargeable batteries. Rare- 
earth elements are much less concentrated at 
around 0.1%, but will still be valuable: they are 
crucial for technologies in the aerospace and 
alternative-energy industries. China currently 
controls the vast majority of supplies. 

Philip Rood, a spokesman for Lockheed Mar- 
tin UK, says that it hopes operations can begin 
in 5-6 years, once Lockheed has completed 
technology tests and environmental studies. 

Data are so far sparse on the degree to which 
the operations would threaten deep-sea life 
such as sediment-dwelling sea cucumbers, 
worms and small crustaceans, or creatures that 
live higher in the water column, such as fish. 
A study at a nodule plain off Peru found that 
sea-floor communities showed limited recov- 
ery after disturbance, with sediment dwellers 
gradually returning over 7 years (H. Bluhm 
Deep-Sea Res. II 48, 3841-3868; 2001). 

Craig Smith, a deep-sea biologist at the 
University of Hawaii at Manoa, will lead an 
initial assessment of sea-floor life for Lock- 
heed’s project, gathering baseline data for the 
potential harvest zone and comparison areas. 
“T think this is going to happen,” he says of 
nodule harvesting. “To manage the environ- 
mental impacts well we really need excellent 
information.” 

Cindy Van Dover, director of the Duke 
University Marine Laboratory in Beaufort, 
North Carolina, has worked on assessments 
of deep-sea mining impacts. She says, “I don’t 
think there are red flags but I do think there 
are yellow flags.” 

Her main concern is one of scale. “One tiny 
patch is not going to make any difference but 
what I can't figure in my head is what happens 
when that’s multiplied.” m 
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Text-mining spat heats up 


Scientists and publishers clash over licences that would let machines read research papers. 


BY RICHARD VAN NOORDEN 


research — if only the gatekeepers 

would let scientists in. Researchers have 
complained bitterly over the past year that 
publishers do not allow them to use computer 
programs to download and crawl across the 
text of research articles, a methodology known 
as text mining that can reveal large-scale pat- 
terns in the studies (see “Uses of text mining’). 

Fearful that their content might be freely 
redistributed, publishers tend to block programs 
that they find crawling the full text of articles, 
making no exceptions for users who have paid 
for access. They give permission only on a case- 
by-case basis to those who negotiate agreements 
on access and use. Now, the European Commis- 
sion (EC) and publishers’ consortia are trying to 
craft clearer rules. But complaints last month 
to an EC group set up to discuss text and data 
mining suggest that disagreements are still rife. 

“Data- and text-mining techniques... could 
hold the key to the next medical breakthrough, 
if only we freed them from their current legal 
tangle,” Neelie Kroes, vice-president of the 
European Commission, told a Brussels intel- 
lectual-property summit last September. 

Publishers say that so far, few research- 
ers are asking permission to mine text. Still, 
Amsterdam-based publisher Elsevier says that 
computer robots crawling its ScienceDirect site 
made up 4% of total web traffic on the platform 
in 2012, almost twice the level in 2011. What- 
ever the intention of such programs, the figure 
suggests that machines, not just humans, are 
increasingly poring over articles. 

Raul Rodriguez-Esteban, a computational 
biologist at drug company Boehringer Ingel- 
heim in Ridgefield, Connecticut, says that he 
ran 160 text-mining queries in 2012. In one, 
he searched more than 23,000 articles to pick 
out hundreds of proteins that could relieve a 
mouse model of multiple sclerosis. He then 
sketched a network of other proteins that inter- 
acted with them, and found new potential drug 
targets. Academic researchers covet this capa- 
bility, but say that it takes months or years to 
negotiate agreements. It took Max Haeussler at 
the University of Santa Cruz, California, three 
years to get the rights to download 3 million 
articles, from which he extracts DNA data to 
annotate an online map of the human genome 
(see Nature 483, 134-135; 2012). 

Later this year, the United Kingdom will 
make text mining for non-commercial pur- 
poses exempt from copyright, potentially 


lE is seen as the future of computer-based 
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Uses of text mining 


Linking genes to research papers. 
The text2genome project pulls out 
DNA sequences from around 3 million 
research papers to produce an online 
genome map in which each region is 
linked to relevant articles. 
go.nature.com/iupijx 


Mapping the brain. The NeuroSynth 
project extracted brain-scan data from 
almost 4,400 research articles, allowing 
users to link locations in the human 
brain with associated research terms 
and topics. neurosynth.org 


Chemistry data. SureChem (owned by 
the same parent company as /Vature) 
harvests and makes freely available data 
on molecules from some 20 million 
patents. surechem.com 


Drug discovery. Researchers searched 
free abstracts from more than 20 million 
articles in the MEDLINE database, and 
discovered an indirect link between 
E-cadherin (a cell-adhesion molecule) 
and Parkinson’s disease. go.nature.com/fsv4az 


allowing scientists to mine any content they 
have paid for. Text miners want the EC to do 
the same. “The slogan that is doing the rounds 
among users currently is: ‘the right to read is 
the right to mine,” says John McNaught, dep- 
uty director of the National Centre for Text 
Mining at the University of Manchester, UK. 
But the EC’s working group to discuss text 
and data mining, set up this year, has already 
run into controversy. After a meeting on 4 Feb- 
ruary, researchers and librarians complained 
that the group was discussing only how to work 
with text-mining licences, not how to exempt 
text mining from copyright. “This will only 
raise barriers to the adoption of this technology 
and make computer-based research in many 
instances impossible,” they wrote on 26 Febru- 
ary ina letter to Kroes and three other European 
commissioners, who have not yet responded. 
A second meeting on 8 March provided little 
assurance that exceptions would be seriously 
considered, says Ross Mounce of the Univer- 
sity of Bath, UK, who is using text mining to 


extract trees of evolutionary relationships from 
the literature. The EC working group hopes to 
reach a conclusion by the end of the year. 

The situation is better in the United States, 
where some lawyers think that text mining 
might be permitted by ‘fair use’ rights, which 
allow snippets of text to be freely copied. But no 
one knows for sure, and many researchers are 
wary of testing the bounds of this legal grey area. 

Some publishers say that unrestricted text 
mining could strain their servers, and so agree- 
ments will always be needed to specify when 
and how articles may be downloaded. Cross- 
Ref, anon-profit collaboration of thousands of 
scholarly publishers, is developing a system to 
let researchers agree to standard text-mining 
terms by clicking a button on a publisher’s 
website. CrossRef’s Geoff Bilder hopes that 
the system will roll out by the end of the year. 

The Copyright Clearance Center (CCC) 
in Danvers, Massachusetts, which works 
with publishers on rights licensing, is pursu- 
ing a more ambitious effort. It would act as 
an intermediary, collecting publishers’ terms 
and content and storing them on a website for 
researchers, says the CCC’s Roy Kaufman. It is 
working with six publishers (including Nature 
Publishing Group) and with drug and chemi- 
cal firms eager to mine the literature. 

Heather Piwowar of the National Evolution- 
ary Synthesis Center in Durham, North Caro- 
lina, who studies how researchers use data, says 
that it is unfair that large firms such as Google 
are allowed to crawl across content to index 
it — yet scientists are restricted. “Is this defen- 
sible on the grounds that Google knows what 
it is doing but The Rest Of Us Can Not Be 
Trusted?,’ she blogs. “I sure hope not.” = 


CORRECTION 

The News story ‘DNA tool kit goes live 
online’ (Nature 495, 150-151; 2013) 
wrongly located Randy Rettberg at the 
Massachusetts Institute of Technology. He 
is at the non-profit organization the iGEM 
Foundation in Cambridge, Massachusetts. 
The News story ‘Sticky problem snares 
wonder material’ (Nature 495, 152-153; 
2013) said that the 2010 Nobel prize was 
awarded for graphene’s discovery, but it 
was for experiments involving graphene. 
And the World View ‘The unlikely wisdom 
of Chairman Mao’ (Nature 495, 143; 2013) 
wrongly stated that Mao Zedong was 
China’s premier. 
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STAR TRACKER 


As an early adopter of astronomical technology, Andrea Ghez is 
revealing secrets about the giant black hole at the Galaxy's centre. 


BY ANN FINKBEINER 


he technology was a complete joy, says Andrea Ghez, think- 
ing back to the mid- 1980s and her first time helping out at an 
Observatory. She wanted to learn everything. “How to open 
the dome! How to fill the instrument with liquid nitrogen! 
Develop the plates! Reduce the data! Coding!” 

And then there was the science. Ghez did not know muchat the start; 
she was majoring in physics at the Massachusetts Institute of Technology 
(MIT) in Cambridge, working for an astronomer as her undergraduate 
research experience. But as she learned more about his research into unu- 
sual cosmic sources of X-rays, Ghez became enthralled by the thought 
that some of those sources might be black holes — singular points with 
a gravitational pull so strong that not even light can escape them. “It got 
me completely fascinated by black holes,” she says. By the time she had 
spent two undergraduate summers working at telescopes in Arizona 
and Chile, Ghez was hooked. “I fell in love with the whole profession” 

Now an astronomer at the University of California, Los Angeles, she 
still feels the same. Her fascination with black holes has led her into a 
pioneering, decades-long study that has proved the existence of the big- 
gest black hole in our cosmic neighbourhood: the 4.1-million-solar-mass 
behemoth that lies at the centre of the Milky Way’” (see ‘The monster in 
the middle’). This work earned her a MacArthur ‘genius’ award in 2008, 
and half of the Crafoord prize, astronomy’s Nobel, in 2012. 

Ghez’s love of technology helps to explain why her quest has been so 
fruitful. Most astronomers use only the tools they know, but Ghez is an 
enthusiastic early adopter — first in line to try out cutting-edge detec- 
tors and optical techniques that are barely out of the laboratory. “T like 
the risk of a new technology,’ she says. Maybe it won't work. But maybe 
it will open a fresh window on the Universe, answering “questions you 
didn’t even know to ask’; she says. “Any time you look, you're astounded!” 

Reinhard Genzel, a director of the Max Planck Institute for 
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Extraterrestrial Physics in Garching, Germany — the co-winner of the 
2012 Crafoord prize and Ghez’s sharpest competitor on the Galactic 
Centre work — puts it very simply. “Andrea,” he says, “is one of a rare 
adventurous class.” 


DEEP FOCUS 

Ghez’s devotion to her work would make her seem fierce — if she weren't 
always smiling, and her sentences didn’t keep exploding into verbal capi- 
tals. As it is, with her barely controlled curls, straight-across eyebrows 
and direct gaze, she conveys a cheerful intensity. She doesn't digress when 
she talks; she focuses. And she has always had a certain determination. 

According to Ghez family legend, when 4-year-old Andrea watched 
the first Apollo Moon landing with her parents in Chicago, Illinois, on 
20 July 1969, she announced that she, too, was going to the Moon as 
an astronaut. True, she also wanted to be a ballerina. But while attend- 
ing the progressive University of Chicago Laboratory Schools, she says, 
she became “really clear” that she loved mathematics and science. That 
passion took her to MIT in 1983 and then, after her epiphany in the 
observatory domes, to the California Institute of Technology (Caltech) 
in Pasadena for graduate studies in astronomy. 

Caltech, Ghez explains, “had the best toys by far”. Among them was 
the 5-metre Hale Telescope, then one of the world’s largest, on Califor- 
nia’s Palomar Mountain. But the toy that particularly captured Ghez’s 
interest was an experimental speckle imager, an instrument intended to 
get around astronomers eternal problem with air. Earth’s atmosphere is 
transparent but turbulent — a collection of bub- 


bling ‘cells’ that are warmer here, cooler there,and NATURE.COM 
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THE MONSTER IN THE MIDDLE 


By tracking stars near the mysterious object at the 
centre of the Milky Way, astronomers have shown that 
they move in years-long orbits; 8 examples are shown 
here. These orbits prove that the object packs the 
mass of 4.1 million Suns into a space smaller than 
the Solar System, and can only be a black hole. 


THE BLACK HOLE 


MARK GARLICK 


flickers, dances and fragments, smearing the 
point-like image of each star into a fuzzy ball. 

Speckle imaging freezes the dancing images 
in place with a camera that captures very short 
exposures every few milliseconds, taking 
maybe 10,000 or more shots in total. The result 
is a sequence of very faint images in which the 
distorted light from each star produces a scat- 
tering of spots: the speckles. Computer pro- 
cessing recombines the speckles into one spot 
per star. Then all the exposures can be aligned 
and stacked to produce a final image with the 
worst of the atmospheric smearing removed. 

At the time, speckle imaging was well estab- 
lished but done mostly at optical wavelengths and used infrequently, 
because it was so computationally intensive. Ghez joined a Caltech group 
that was developing a speckle imager capable of working at infrared wave- 
lengths emitted from interstellar-dust-shrouded objects such as active 
galaxies: spirals and ellipticals that had dusty but exceptionally bright 
regions at their centres. The thinking was that both the dust and the light 
emissions came from vast quantities of stars and gas spiralling into black 
holes millions or billions of times the mass of the Sun. Ghez was given the 
job of helping to write image-analysis software to give the speckle-imaging 
device the highest possible resolution. “Oh, this sounds great,’ she says, 
remembering her reaction. “Black holes. Technique. I'm good” 

In the end, however, the imager couldn't quite see the nuclei of active 
galaxies. So for her PhD dissertation, Ghez turned to brighter targets: 
newborn stars in our Galaxy. Astronomers knew in broad terms that 
stars are born in thick, interstellar gas clouds, when gravity pulls the gas 
into hot, dense knots that ignite with thermonuclear fusion. But they 


“That's the 
evidence for 
a black hole. 
There are no 
alternatives 
we know of.” 
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didn’t understand why a substantial fraction of the stars in the Sun's 
neighbourhood are binaries: pairs of stars that orbit one another, often 
at close range. Were binaries born that way? Or did they somehow pair 
up later in life? Theory was not much help; the available star-formation 
models focused on single stars such as the Sun. 

Ghez wanted to resolve the issue with data: do binary stars form as 
binaries, yes or no? (She had developed a taste for questions that could 
be answered crisply. “If you ask mushy questions,’ she says, “you usually 
get mushy answers.’) Beginning in 1990, Ghez used her speckle-imag- 
ing techniques in two known star-forming regions to survey T Tauri 
objects — Sun-like stars just beginning their lives. With the higher reso- 
lution, she could see that many of the infant stars were indeed binaries, 
too young and too close together for the companions to have formed 
separately and then coalesced’. Ghez’s finding continues to have impli- 
cations for issues such as the search for extrasolar planets, because the 
complex gravitational fields in a binary system are thought to make it 
difficult for planets to form in the vicinity. 

Ghez focused on this topic exclusively until 1994, when she was hired at 
the University of California, Los Angeles, and gained access to a 10-metre 
instrument opened only a few years before at the Keck Observatory on 
Mauna Kea in Hawaii, and jointly owned by Caltech and the University 
of California. The biggest of a new generation of very big telescopes, Keck 
was ideal for catching the light of faint objects. But like all telescopes on 
the ground, its resolution was limited to what the atmosphere allowed. So 
Ghez applied her expertise in speckle imaging, realizing that its high reso- 
lution, combined with the telescope’s prodigious light- gathering power, 
would finally let her look at black holes. 

Astronomers had long suspected that a supermassive black hole lay at 
the centre of every galaxy; active galaxies were the ones that just happened 
to have above-average quantities of matter feeding the black hole. If that 
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was true, a giant black hole ought to lie at 
the centre of the Milky Way. That region 
is difficult to observe from Earth because 
there is a lot of interstellar gas and dust 
in the way and only certain wavelengths 
of radiation can make it through. But 
observations in y-rays, X-rays and radio 
and infrared emissions were consistent 
with the presence of gas moving at high 
velocities, and stars crowding together in 
high numbers. They had to be orbiting 
something massive and hard to see. 

But what? To remove any doubt about 
the nature of the central object, observ- 
ers would have to show that it was too 
small to be anything but a black hole. 
(The fundamental equations of gravity 
guarantee that, for a given mass, a black 
hole is smaller than any possible system 
made of ordinary matter.) 

Ghez and her team decided to tackle 
the problem with a straightforward 
but tedious strategy: track stars for 
years and decades as they orbit the 
central object. The radius and period 
of each orbit would give the central 
object’s mass, and the distance of clos- 
est approach would put an upper limit 
on its size. The astronomers started in 
1995 by mapping the positions of stars 
in a dense cluster near the suspected location of the black hole. Every 
year or so, they mapped the positions again. 

“With two points you draw a line and get the velocity,” says Ghez. 
“With three points, you actually believe your line?” And then you keep 
going. “You wait long enough, keep taking pictures” and you should be 
able to start tracing out the curves of the stars’ orbits. By 1999, Ghez and 
her colleagues had done exactly that with three stars’. 


SEEING THE LIGHT 
These observations showed the orbit in only two dimensions, pro- 
jected onto the plane of the sky. Getting the full orbit required the third 
dimension, the star’s motion towards and away from Earth, which meant 
measuring the spectrum of the light it emitted and watching how its 
velocity shifted the wavelength of the spectral lines — measurements 
that required more light than speckle imaging could supply. Ghez, how- 
ever, was already working with astronomers developing a more advanced 
technology called adaptive optics. The idea was to take the light gathered 
by the telescope and bounce it offa flexible mirror that could be continu- 
ously deformed to counteract distortions caused by the atmosphere. The 
technique wasted much less light than speckle imaging did, so it prom- 
ised higher resolution and the ability to capture the spectra of fainter 
objects. Ghez was doing proof-of concept observations, and was “right 
there and ready to snatch it up the minute it was ready to go’, she says. 
In June 2002, Ghez and her team used adaptive optics to observe one 
of their three stars, called SO-2, and found that by adding in older speckle 
data they could map its complete orbit’. The orbit took 16 years, implied 
a central mass of 4.1 million Suns and showed that the star’s closest 
approach to that mass was less than twice the distance from Pluto to 
the Sun — meaning that the central object was no larger than the Solar 
System. With that much mass in that small a space, says Ghez, “that’s 
the evidence for a black hole. There are no alternatives that we know of” 
That result was satisfying but not unexpected. Much more surprising 
was that S0-2’s spectrum showed it to be less than 10 million years old. 
No star orbiting close to a black hole should be that young, because the 
black hole’s tidal forces would shred any cloud of gas and dust before 
it could form. There was no good explanation for why the young stars 
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Andrea Ghez wants to find out what lies at the heart of our Galaxy. 


are there, says Thomas Prince, an 
astronomer at Caltech and Ghez’s 
first adviser, “and there still is not”. 

In subsequent years, as adaptive 
optics improved, Ghez and her team 
measured the velocities of thousands 
of stars in the Galactic Centre, and 
estimated the orbits of about two 
dozen; the latest is a star named 
S0-102, which has the shortest orbit 
yet at 11.5 years (ref. 2). Most of them 
also proved to be equally young and 
enigmatic. “It's beautiful work really,’ 
says Rosemary Wyse, an astrono- 
mer at Johns Hopkins University in 
Baltimore, Maryland, who studies the 
Milky Way. “It’s one of those pains- 
takingly technical tours de force that 
produces good science results.” 

Only one other team in the world is 
doing this type of work: Genzel's. Both 
groups have access to large telescopes 
and sophisticated adaptive-optics sys- 
‘Si tems, and the word is that competition 

ms is fierce — which neither Ghez nor 
Genzel denies, although they try to 
play it down. Perhaps more important 
is that they agree on the same meas- 
urements on the same stars, differing 
mainly in interpretations. 

The latest example concerns an object that might help to show how a 
black hole destroys its prey. Both Ghez and Genzel agree that it is hot, red 
and heading straight towards the Galactic Centre. Genzel says that it is a 
cloud of gas with the mass of three Earths, that it will be torn to smither- 
eens by the black hole and that in the process it will radiate X-rays fiercely”. 
Ghez maintains that it is hard to say what the object is; it could equally 
bea star moving through gas, in which case the black hole is unlikely to 
disrupt it. Genzel says that the object’s closest approach to the black hole 
will be this autumn; Ghez says maybe later and in any case the timing 
is uncertain. Avi Loeb, an astronomer at Harvard University in Cam- 
bridge, Massachusetts, agrees with both Ghez and Genzel that this kind 
of competition over difficult, expensive observations makes the resulting 
astronomy more credible. “Overall,” he says, “it's good for science.” 

Ghez is already looking to the next technologies she can adopt. She 
is on the scientific advisory committee of the Thirty Meter Telescope, 
which is due to start observations from Mauna Kea towards the end of 
the decade and will gather roughly ten times more light than Keck. She is 
also advising the developers ofa version of adaptive optics that promises 
even higher resolutions than are currently available. 

“Tm positioning myself? says Ghez. “I want to know how to use these 
systems, I want to know how to use the data, I want this all to work” Like 
other great scientists, says Thomas Soifer, an astronomer at Caltech and 
another of her early advisers, “she has this single-minded attitude of, 
‘Tm going to beat this problem into submission”. 

Ghez is 47 years old and the stars that she has been tracking since 1995 
have orbits ranging from tens to hundreds of years. How long does she 
plan on living? “I don’t know,’ says Ghez. “I figure the more fun I have, 
the longer I'll live.” m 
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Ann Finkbeiner is a freelance writer in Baltimore, Maryland. 
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SLOW 
SCIENCE 


The world’s longest- 
running experiments 
remind us that science is 
amarathon, not a sprint. 


BY BRIAN OWENS 


Ithough science is a long-term pursuit, research 

is often practised over short timescales: a 
discrete experiment or a self-contained project 
constrained by the length of a funding cycle. But 
some investigations cannot be rushed. To study human 
lifespans or the roiling of Earth’s crust and the Sun’s 
surface, for instance, requires decades and even centuries. 
Here, Nature takes a look at five of science’s longest- 
running projects, some of which have been amassing data 
continuously for centuries. Some generate hundreds of 
papers a year; one produces a single data point per decade. 
Experiments operating at this pace are challenged by 
shifting research priorities and technologies, and their 
existence is regularly threatened by funding droughts and 
changes in stewardship. But they are bound together by 
the foresight of the scientists who started them and the 
patience and dedication of those who 

carry the torch. If persistence predicts a 

long and healthy life — as one 90-year 

study of human longevity has suggested 

— then the scientists featured here 

could set some records themselves. 
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Galileo was drawing sunspots as early as 1613. 


AQQ YEARS sou 


Astronomers have been recording the appearance of sunspots ever 
since the telescope was invented more than 400 years ago; even Galileo 
recorded his observations. But early observers had no knowledge of 
what the dark patches on the Sun’s surface were, or of the magnetic 
fields that created them. That began to change when, in 1848, the Swiss 
astronomer Rudolf Wolf began making systematic observations and 
developed a formula that is still used today to calculate the interna- 
tional sunspot number, also known as the Wolf number, which gives a 
measure of how solar activity is changing over time. 

In 2011, Frédéric Clette became director of the Solar Influences 
Data Analysis Center, based at the Royal Observatory of Belgium in 
Uccle, which curates sunspot counts gleaned from photographs and 
hand drawings of the Sun’s surface made by more than 500 observers 
since 1700. 

The data are invaluable for predicting sunspot activity, says Leif 
Svalgaard, a solar physicist at Stanford University in California. The 
activity seems to wax and wane over the course of 11 years or so, and 
the streams of charged particles that the sunspots spray into space can 
affect satellites and electronics on Earth. The detailed records help 
researchers to understand why that cycle happens, and to refine predic- 
tions of particularly intense events. “The longer the time series is, the 
better we can check our theories,” Svalgaard says. Around 200 papers 
a year cite sunspot data, in fields extending beyond solar physics to 
geomagnetism, atmospheric science and climate science. 

But the enterprise runs largely on goodwill. Each month, the Belgian 
centre collates sunspot numbers from about 90 observers, two-thirds 
of them amateurs, who use small optical telescopes no more powerful 
than those available 200 years ago. And although it is a World Data 
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Centre recognized by the International Council for Science in Paris, it 
receives no funding from the organization. Clette works with one other 
part-time person to maintain the database, in addition to his ‘night-job 
as an astronomer at the Royal Observatory of Belgium. 

Still, says Clette, it is fascinating to ‘work’ with colleagues from 
hundreds of years ago. For instance, he says that even though Galileo's 
coverage of the Sun was spotty because Galileo was “busy with planets 
and other things”, the drawings are detailed enough to reveal infor- 
mation about the magnetic structure of the sunspot groups and the 
size and tilt of the star’s dipole. “You can extract from those drawings 
exactly the same information as from a drawing made today,’ he says. 

More than that, however, he is taken with his forebears’ foresight. 
They faithfully recorded what they saw, thinking that it could be use- 
ful later on, he says. “It’s a fundamental aspect of science,’ he says, “not 
worrying what will be the final result.” 


170 YEARS 


Although consistently active, every few thousand years, Mount 
Vesuvius erupts in spectacular style. The last time it did so, in aD 79, 
it consumed the city of Pompeii in the flames and before that, about 
3,800 years ago, it covered all of present-day Naples in hot gas and 
rock (see Nature 473, 140-141; 2011). The Vesuvius Observatory, the 
oldest volcano research station in the world, has been keeping an eye 
on its inhospitable subject since 1841, logging all the volcano’s seismic 
rumbles to try to spot approaching danger. Originally perched 600 
metres up the side of the volcano, far enough from the summit to 
be safe from ejected debris and high enough on a knoll to avoid the 
lava flows, the observatory has shaped the way that volcanology and 
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geology is done, says Marcello Martini, its current director. 

Macedonio Melloni, the observatory’s first director, did pioneer- 
ing work on the magnetic properties of lava that was crucial to later 
studies of palaeomagnetism — the history of Earth’s magnetic field as 
recorded in rocks. In 1856, its second director, Luigi Palmieri, invented 
the electromagnetic seismograph, which was much more sensitive to 
ground tremors than previous machines and allowed him to predict 
eruptions. Under Palmieri and subsequent directors the observatory 
contributed to the development of much of the instrumentation used 
to monitor volcanoes worldwide. In the early twentieth century, for 
example, Giuseppe Mercalli developed the scale still used today to 
classify volcanic activity. 

But the building itself no longer has the same role. “In the early 
stages it was important to be as close to the action as possible, but 
that’s not necessarily the case any more,’ says Haraldur Sigurdsson, a 
volcanologist at the University of Rhode Island in Kingston. Most of 
the monitoring is now done by remote, ground-based sensors, with 
the data sent back to the lab of the National Institute of Geophysics 
and Volcanology in Naples. The original buildings were turned into 
a museum in 1970. 

In addition to informing scientific theory, the observations are used 
to predict trouble and protect the public — as they did successfully in 
1944. The Naples lab, where scientists are on duty 24 hours a day, also 
keeps an eye on Mount Stromboli, on an island north of Sicily; the 
Campi Flegrei caldera west of Naples; and the island of Ischia. Sigurds- 
son, says, however, that the future of volcanology lies not with putting 
sensors on the volcanoes already known to be dangerous, but with 
satellite-based radar that can study ground deformation everywhere 
and pick out the risky regions irrespective of geologists’ expectations. 
“We should be moving towards an internationally coordinated system 
of volcano monitoring that is not tied down to bricks and mortar on 
the side of a volcano, but looking at it in a really comprehensive way 
globally,” he says. 


Now a museum, the original building of the Vesuvius Observatory was used to monitor volcanic activity from the side of the mountain. 
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Rothamsted has been home to experiments on the effects of fertilizer on the yield of wheat since 1843. 


/ () Y A B S HARVESTING 

DATA 
The stewards of long-term research projects are keen to maintain the 
integrity of the work, but also to keep it relevant. That is the case for 
Andy Macdonald who, in 2008, inherited a set of agricultural experi- 
ments that have been testing the effects of mineral fertilizers and 
organic manure on crop production since 1843. 

Started by the fertilizer magnate John Lawes on the grounds of his 
country estate at Rothamsted, north of London, these studies have been 
used to test how nitrogen, phosphorus, potassium, sodium, magnesium 
and farmyard manure affect the yields of several staple crops, including 
wheat, barley, legumes and root crops. 

“After 20 or 30 years, many of the basic questions about the relative 
importance of different fertilizers were pretty well answered,’ says Mac- 
donald, manager of the ‘classical experiments’ now run at Rothamsted 
Research. Nitrogen has the largest effect, followed by phosphorus. So 
the experiments are periodically updated to test new ideas and keep 
them relevant to current farming practice. In 1968, for example, the 
long-strawed cereal crops that had been grown since the experiments 
started were replaced by the higher-yielding short-strawed cereal crops 
being adopted by farmers. Macdonald says that these new crops turned 
out to need more fertilizer than the traditional cultivars because of 
the additional nutrients they were extracting from the soil, so farmers 
had to adapt. 

“Rothamsted is the granddaddy of long-term agricultural research,’ 
says Phil Robertson, director of the W. K. Kellogg Biological Station, 
a long-term agricultural research site at Michigan State University in 
Hickory Corners. The unbroken chain of data is invaluable, he says. 
Not only is Rothamsted able to study environmental and biological 
trends — such as carbon storage in soil or the effects of invasive species 
— that become apparent only over long timescales, but it also provides 
a platform for shorter studies on, say, nitrate loss in soil. 

The Rothamsted archive holds about 300,000 preserved plants and 
soil samples that have been collected since the experiments began. 
In 2003, scientists extracted the DNA of two wheat pathogens from 
archived samples dating back to 1843 and showed that industrial 
sulphur dioxide emissions affected which one was dominant’. 

It can bea struggle to keep funding agencies interested. Rothamsted 
gets by on a mixture of government funding, competitive grants and 
a trust fund set up by Lawes before he died. “As a funder you have to 
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commit to maintaining observations even during periods where there 
may not be any exciting results,” says Robertson, who was involved 
in setting up the US Department of Agriculture’s Long Term Agro- 
Ecosystem Research network last year. Macdonald and his team take 
pride in their history. “I think back sometimes to John Lawes,’ Mac- 
donald says. “I feel a great responsibility to ensure that the experiments 
are handed on in good condition for the next generations. They're not a 
museum piece, they're part of our living scientific community.” 


Lewis Terman started one of the lengthiest studies of human development. 


J0 YEARS 


WATCHING 
GENIUS BLOSSOM 


In 1921, psychologist Lewis Terman of Stanford University in 
California started tracking more than 1,500 gifted children — as iden- 
tified by the Stanford-Binet IQ test that he developed — born between 
1900 and 1925. It was one of the world’s first longitudinal studies and it 
has now accrued the longest in-depth record of human development, 
having tracked the participants for nine decades — looking at their 
home lives, education, interests, abilities and personality. 

One of Terman’s main goals in his ‘Genetic Studies of Genius’ was to 
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disprove the then-common assumption that gifted children were sickly, 
socially inept and not well-rounded. But even by the standards of his 
day, the study design was plagued with problems. His selection method 
was haphazard, administering the tests based largely on recommenda- 
tions from teachers, and the sample was far from representative (more 
than 90% were white and upper or middle class, and Terman even 
enrolled his own children). What is more, Terman skewed the very life 
outcomes he was trying to study, writing letters of recommendation for 
his “Termites, as the participants became known, and helping several 
to get into Stanford. 

By tracking the children into adulthood, Terman showed that they 
were just as healthy and well-adjusted as the general population and that 
they generally grew into successful, happy adults. And, as it progressed, 
researchers tweaked the study to try to overcome some of its flaws. 

In the 1980s, for example, George Vaillant, a psychologist at Harvard 
Medical School in Boston, Massachusetts, began using Terman data 
to supplement his own long-running study of adult development and 
started collecting the death certificates of the Termites. Using these 
records, Howard Friedman, a psychologist at the University of Cali- 
fornia, Riverside, was able to come up with one of the Terman study’s 
most significant findings. He showed that conscientiousness — namely 
prudence, persistence and planning — measured in both childhood 
and adulthood is a key psychological factor predicting longevity, and 
was associated with as many as six or seven extra years of life”. “It would 
not have been easy to discover this without a lifespan longitudinal pro- 
spective data set,’ says Friedman. 

Longitudinal studies evolve according to scientific fashions, says 
Laura Carstensen, director of the Stanford Center on Longevity. New 
investigators will add measures and modify or discard those that they 
think are no longer interesting or are obsolete. “Today we would meas- 
ure emotional well-being, for example, in a very different way from in 
1900,” she says. So in many ways “looking at a longitudinal data set is 


almost like writing a history of psychology”. 
WAITING FOR 


O VEARS sess: 


On his second day of work at the University of Queensland in Brisbane, 
Australia, in 1961, physicist John Mainstone came across a quirky little 
experiment that had been quietly running in a cupboard for 34 years. 
Fifty years later, he is still looking after it — and still waiting to witness 
its most dramatic activity. 

The pitch-drop experiment started when Thomas Parnell, the 
university's first professor of physics, set up a demonstration for his 
students to show that a sample of pitch, a black tar distillate that is brit- 
tle enough to shatter with a hammer when cold, will act like a liquid 
and flow through a funnel, dripping out of the bottom like the world’s 
slowest hour glass. It did, at a rate of about one drop every 6-12 years. 
Mainstone expects — cautiously — the ninth drop to fall sometime 
towards the end of this year. 

The experiment is not exactly a hotbed of discovery. In 86 years, it 
has yielded exactly one scientific paper’, which calculated that the pitch 
was 230 billion times more viscous than water. And in 2005, it earned 
the dubious distinction of an Ig Nobel prize (see Nature 437, 938-939; 
2005), a cheeky parody of the Nobels. 

There is, however, still some science to be gleaned. No one has ever 
seen a drop fall — a webcam recording the experiment failed just when 
the most recent drop fell, in November 2000 — so exactly what happens 
when the drop detaches from the body of pitch above is unknown. It 
will also take another few decades to tease out how weather, the intro- 
duction of air conditioning and the vibrations from renovation work 
in the building influence the drip rate. 

But Mainstone says that the experiment’s value lies not in its science, 
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The pitch-drop experiment has yielded just eight drops since 1927. 


but in its historical and cultural impact: it has inspired sculptors, poets 
and writers to muse about the passage of time and the pace of modern 
life. It also provides a link to scientific history, and a sense of constancy. 
“It's going about its business while the world is going through all sorts 
of turmoil,” Mainstone says. And with a great deal of pitch left in the 
funnel, it has the potential to serenely ignore that turmoil for another 
150 years or so. Luckily, 78-year-old Mainstone has already convinced 
a younger colleague to watch over the experiment after he is gone. m 


Brian Owens is a freelance writer in New Brunswick, Canada. 
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Sustainable development 
goals for people and planet 


Planetary stability must be integrated with United Nations targets to fight 
poverty and secure human well-being, argue David Griggs and colleagues. 


he United Nations Rio+20 summit 
To Brazil in 2012 committed govern- 
ments to create a set of sustainable 
development goals (SDGs) that would be 
integrated into the follow-up to the Millen- 
nium Development Goals (MDGs) after 
their 2015 deadline. Discussions on how to 
formulate these continue this week at UN 
headquarters in New York. 
We argue that the protection of Earth's 


life-support system and poverty reduction 
must be the twin priorities for SDGs. It is not 
enough simply to extend MDGs, as some are 
suggesting, because humans are transform- 
ing the planet in ways that could undermine 
development gains. 

As mounting research shows, the stable 
functioning of Earth systems — including 
the atmosphere, oceans, forests, waterways, 
biodiversity and biogeochemical cycles — is 
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a prerequisite for a thriving global society. 
With the human population set to rise to 
9 billion by 2050, definitions of sustainable 
development must be revised to include the 
security of people and the planet. 

Defining a unified set of SDGs is challeng- 
ing, especially when there can be conflict 
between individual goals, such as energy 
provision and climate-change prevention. 
But we show here that it is possible. By > 
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A UNIFIED FRAMEWORK 


A set of six sustainable development goals (SDGs) follow from combining the Millennium 
Development Goals (MDGs) with conditions necessary to assure the stability of Earth's systems. 


NEW PARADIGM 


Earth’s life- 
support system 


Society 


Economy 


People 


MDGs start 


> combining the MDGs with global envi- 
ronmental targets drawn from science and 
from existing international agreements, we 
propose six SDGs with provisional targets 
for 2030. 


ENTER THE ANTHROPOCENE 

Since 2000, the MDGs have focused on reduc- 
ing extreme poverty in developing countries. 
But pursuing a post-2015 agenda focused 
only on poverty alleviation could undermine 
the agenda's purpose. Growing evidence and 
real-world changes convincingly show that 
humanity is driving global environmental 
change and has pushed us into a new geologi- 
cal epoch — the Anthropocene’. 

Further human pressure risks causing 
widespread, abrupt and possibly irrevers- 
ible changes to basic Earth-system pro- 
cesses. Water shortages, extreme weather, 
deteriorating conditions for food produc- 
tion, ecosystem loss, ocean acidification 
and sea-level rise are real dangers that 
could threaten development and trig- 
ger humanitarian crises across the globe. 
Growing affluence and the right to develop- 
ment among the world’s poor demand that 
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PLANETARY MUST-HAVES 
Materials use 
Clean air 
Nutrient (N and P) cycles 
Hydrological cycles 
Ecosystem services 
Biodiversity 
Climate stability 
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SDGs begin 


people of all nations make the transition to 
sustainable lifestyles. 

By coordinating actions internationally, 
SDGs can address these risks. The MDGs 
have shown that a goal-setting approach 
raises both public and policy support and 
channels funds effectively towards urgent 
global problems’. However, the political 
reluctance to go beyond merely extending 
the MDGs is a concern. 

The targets for the SDGs must be measur- 
able, based on the latest research and should 
apply to developed and developing coun- 
tries. First, however, we need to reframe the 
UN paradigm of three pillars of sustainable 
development — economic, social and envi- 
ronmental — and instead view it as a nested 
concept. The global economy services soci- 
ety, which lies within Earth’s life-support 
system. The definition of sustainable devel- 
opment, as laid out in the 1987 report from 
the UN World Commission on Environ- 
ment and Development (the Brundtland 
Commission), should therefore be rede- 
fined to “development that meets the needs 
of the present while safeguarding Earth’s 
life-support system, on which the welfare 
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NEW DEFINITION 


Sustainable development in the 
Anthropocene: “Development that meets 
the needs of the present while 
safeguarding Earth’s life-support 
system, on which the welfare of current 
and future generations depends.” 


of current and future generations depends”. 
To set appropriate goals and targets, 
environmental conditions have to be 
identified that enable prosperous human 
development and set tolerable ranges for 
the biosphere to remain in that state. The 
extraordinarily stable Holocene epoch that 
allowed our ancestors to develop agricul- 
ture and modern societies during the past 
10,000 years provides a scientific reference 
point. Indeed, these are the only conditions 
we know that can support modern life. 
Building on decades of research, a 2009 
analysis defined planetary boundaries 
which would be unsafe to transgress for 
nine Earth-system processes’: climate 
change; rate of biodiversity loss (terrestrial 
and marine); interference with the nitrogen 
and phosphorus cycles; stratospheric ozone 
depletion; ocean acidification; global fresh- 
water use; change in land use; chemical pol- 
lution; and atmospheric aerosol loading. 
Adapting this planetary boundaries work, 
and using recent credible scientific studies 
and existing international processes — such 
as the United Nations Framework Conven- 
tion on Climate Change — we extracted a 


SIX SUSTAINABLE DEVELOPMENT GOALS 


Some provisional targets for 2030 


© Goal 1: Thriving lives and livelihoods. End 
poverty and improve well-being through 
access to education, employment and 
information, better health and housing, and 
reduced inequality while moving towards 
sustainable consumption and production. 
This extends many targets of the 
Millennium Development Goals (MDGs) on 
poverty, health and urban environments 
and applies them to developed nations. It 
should include targets on clean air that build 
on World Health Organization guidelines for 
pollutants such as black carbon®; reductions 
in emissions of stratospheric ozone-depleting 
substances in line with projections from the 
Montreal Protocol; critical loads for man-made 
chemical compounds and toxic materials; and 
sustainable practices for extraction, use and 
recycling of scarce minerals and metals and 
other natural resources. 


© Goal 2: Sustainable food security. 

End hunger and achieve long-term food 
security — including better nutrition — 
through sustainable systems of production, 
distribution and consumption. 

The MDG hunger target should be extended 
and targets added to limit nitrogen and 
phosphorus use in agriculture*”®. Nutrient- 
use efficiency should improve by 20% by 
2020; no more than 35 million tonnes of 
nitrogen per year should be extracted from the 


list of sustainability ‘must-haves’ for human 
prosperity (see ‘A unified framework). 

We combined these with the MDG 
targets, updated and extended for 2030, 
to produce six SDGs: thriving lives and 
livelihoods, sustainable food security, 
sustainable water security, universal clean 
energy, healthy and productive ecosystems, 
and governance for sustainable societies 
(see ‘Some provisional targets for 2030’). 

The driving principles remain: reduc- 
ing poverty and hunger, improving health 
and well-being and creating sustainable 
production and consumption patterns. A 
goal of improving lives and livelihoods, for 
example, would promote sustainable access 
to food, water and energy while protecting 
biodiversity and ecosystem services. 

None of this is possible without changes 
to the economic playing field*. National 
policies should, like carbon pricing, place a 
value on natural capital and a cost on unsus- 
tainable actions. International governance 
of the global commons should be strength- 
ened, for example through binding agree- 
ments on climate change, by halting the loss 
of biodiversity and ecosystem services and 


atmosphere; phosphorus flow to the oceans 
should not exceed 10 million tonnes a year; 
and phosphorus runoff to lakes and rivers 
should halve by 2030. 


© Goal 3: Sustainable water security. 
Achieve universal access to clean water 
and basic sanitation, and ensure efficient 
allocation through integrated water- 
resource management. 

This would contribute to MDG health 
targets, restrict global water runoff to less 
than 4000 cubic kilometres a year and limit 
volumes withdrawn from river basins to no 
more than 50-80% of mean annual flow>”. 


© Goal 4: Universal clean energy. Improve 
universal, affordable access to clean energy 
that minimizes local pollution and health 
impacts and mitigates global warming. 

This contributes to the UN commitment 
to sustainable energy for all, and addresses 
MDG targets on education, gender equity and 
health. To ensure at least a 50% probability of 
staying within 2°C warming", sustainability 
targets should aim for global greenhouse- 
gas emissions to peak in 2015-20, drop by 
3-5% a year until 2030, and fall by 50-80% 
by 2050. 


© Goal 5: Healthy and productive 
ecosystems. Sustain biodiversity and 


by addressing other sustainability concerns. 
The SDG framework manages trade-offs 
and maximizes synergies between targets, 
and can be implemented from international 
to city scales. It integrates social, economic 
and environmental dimensions and pro- 
vides guidance for humanity to pros- 
per in the long term. A small number of 
goals is essential for 


“None of this focus; others could 
is possible be added but should 
without build on the core six. 
changes to There are many 
the economic gaps and uncertain- 
playing field.” tiesin our knowledge 


of global environ- 

mental risks and how to enable societies to 
become resource-efficient, sustainable and 
wealthy. Research initiatives such as Future 
Earth, a ten-year programme coordinated 
by the International Council for Science’, 
are needed to refine targets and provide 
sustainable solutions for human well-being. 
But the first step is for policy-makers to 
embrace a unified environmental and social 
framework for the SDGs, so that today’s 
advances in development are not lost as our 
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ecosystem services through better 
management, valuation, measurement, 
conservation and restoration. 

This combines the MDG environmental 
targets with 2030 projections of the Aichi 
Targets adopted by the Convention on 
Biological Diversity (see www.cbd.int/sp/ 
targets). Extinctions should not exceed ten 
times the natural background rate. At least 
70% of species in any ecosystem and 70% 
of forests should be retained. Aquatic and 
marine ecosystems should be managed 
to safeguard areas crucial for biodiversity, 
ecosystem services and fisheries. 


© Goal 6: Governance for sustainable 
societies. Transform governance and 
institutions at all levels to address the other 
five sustainable development goals. 

This would build on MDG partnerships 
and incorporate environmental and social 
targets into global trade, investment and 
finance’. Subsidies on fossil fuels and policies 
that support unsustainable agricultural and 
fisheries practices should be eliminated by 
2020; product prices should incorporate 
social and environmental impacts. National 
monitoring, reporting and verification 
systems must be established for sustainable- 
development targets; and open access to 
information and decision-making processes 
should be secured at all levels. 


planet ceases to function for the benefit of a 
global population. = 
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Trucks from an illegal logging operation transport timber along a partially paved highway that cuts deep into the Brazilian Amazon. 


A global map for 
road building 


Roads are proliferating across the planet. Located and designed wisely, they can help 
rather than harm the environment, argue William F. Laurance and Andrew Balmford. 


early 100,000 kilometres of road 
Nee the Amazon rainforest. 

That is enough to circle the Earth 
two-and-a-half times. Even in formerly 
remote corners of the world — the Congo 
region, Borneo, Siberia, Namibia — road 
networks are expanding apace. This global 
road rush is being driven by escalating 
demand for minerals, fossil fuels, timber 
and arable land, and by developing nations 
as they work to improve transport and 
energy infrastructures’. 

Roads can bring myriad environmental 
problems. In the Amazon, new roads 
in forested areas often promote illegal 
colonization of undisturbed areas, as well as 
mining, hunting and land speculation. More 
than 95% of deforestation, fires and atmos- 
pheric carbon emissions in the Brazilian 
Amazon occur within 50 kilometres of a 
road’. 

Yet the effects ofa road vary depending 
on its location and design. A paved high- 
way slicing through a large forest tract 
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can precipitate an environmental disaster. 
Conversely, in places where farming is 
already widespread and intact habitat is 
scarce, and where there are sizeable gaps 
between current and potential farm yields, 
building high-quality roads can improve 
farms efficiency, increase their profitability 
and limit their environmental impact. 

We propose that environmental scientists, 
planners, road engineers and other stake- 
holders carry out a global ‘road-zoning’ 
project to map areas that should remain 
road-free and those in which transport 
urgently needs improving. 


LAND-USE PRESSURES 

The twenty-first century will bring pro- 
found changes in land use, many of them 
unavoidable and even desirable. Food 
demand is projected to double by 2050. 
Under current farming practices, this would 
require an additional 1 billion hectares of 
farming and grazing land’ — an area the 
size of Canada. 
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Given escalating demand for food, fibre 
and biofuels, researchers and policy-makers 
have focused on improving agriculture 
through the use of modern crop varieties, 
fertilizers, pest control and better transport. 
The hope is that such technologies will allow 
farmers to increase yields without using too 
much extra land*. 

In practice, however, by making farming 
more profitable, yield improvements can 
encourage the conversion of land for more 
crop and livestock production’. For instance, 
as a result of high demand and innovations 
suchas new cultivars, oil-palm plantations are 
rapidly expanding across the tropics — often 
at the expense of biodiversity-rich rainforests. 

We are convinced that increasing agricul- 
tural yields will lessen the impact of farming 
on natural ecosystems only if coupled with 
effective land-use planning’. Roads, which 
profoundly influence the footprint of human 
activities, are a key element of such planning. 

Numerous factors — economic, politi- 
cal, social and practical — influence road 
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planning. Buta few key principles could help 
to guide the location and design of roads. For 
example, the most environmentally damag- 
ing roads are those that penetrate relatively 
pristine regions. This is because the proba- 
bility that a patch of land will be cleared rises 
dramatically if an adjacent area has already 
been cleared’. For this reason, the first cut 
into a forest is also the most crucial. 
Furthermore, paved highways typically 
have much larger environmental impacts 
than unpaved roads, especially in wet 
environments in which unpaved roads can 
become seasonally impassable. In Brazil, 
for instance, the Belém-Brasilia Highway, 
completed in the early 1970s, now cuts a 
400-kilometre-wide swathe of cleared forest 
and secondary roads through the Amazon’. 
In certain contexts, however, road 
building, or improvements such as paving, 
can be socially and environmentally 
beneficial. Often, agriculture follows roads 
created for other purposes, such as mining 
or logging. This can result in the expansion 
of farms into places with marginal soils or 
climates, or into locations that are too far from 
markets to be cost-effective. Conversely, well- 
planned roads can increase farmers’ access 
to markets, reducing waste and improving 
profits. Anecdotal evidence indicates that 
ongoing road improvements in parts of 
sub-Saharan Africa are gradually raising rural 
farmers access to fertilizers and increasing 
their capacity to transport crops to markets. 
Several studies suggest that road 
improvements in areas suited to agricul- 
tural development can attract migrants away 
from vulnerable areas, such as the edges of 
pristine forests®. Concentrating people in 
carefully defined areas is beneficial, because 
the relationship between deforestation and 
population density is nonlinear. In gen- 
eral, migrants entering an already cleared 
area remove much less forest than those 


who arrived first’; latecomers may include 
merchants instead of farmers and loggers, 
for instance, or farmers who buy up prev- 
iously established plots. 

It is much easier for policy-makers to 
influence patterns of road development than 
to affect more socially complex problems such 
as population growth and overconsumption. 
Roads can be re-routed, cancelled or delayed. 
Large road projects are often funded by 
taxpayers, investors or international donors 
who can be surprisingly responsive to 


environmental concerns. For example, if 


corporations that build environmentally bad 
roads are publicly named, they can lose cus- 
tomers and shareholders. Concord Pacific, a 
Malaysian logging corporation, was publicly 
castigated in the early 2000s for bulldozing a 


180-kilometre-long road into the highlands of 


Papua New Guinea — ostensibly to aid local 
communities. After the company grabbed 
more than US$60 million in illegal timber’, 
it was fined $97 million by the national court 
of Papua New Guinea. 


TRANSPORT PLANNING 
We believe that a collaborative, global 
zoning exercise is needed to identify where 
road building or improvement should bea 
priority, where it should be restricted and 
where existing roads should be closed. A 
multidisciplinary team could integrate 
and standardize satellite data on intact 
habitats with information on transport 
infrastructure, agricultural yields and 
losses, biodiversity indicators, carbon stor- 
age and other relevant factors. Much of this 
information has been recorded or can be 
extrapolated from current data sources. 
The next step would be to make the 
results of these analyses readily available as 
high-resolution, colour-coded maps to aid 
policy-makers, conservation organizations 
and others in planning roads. This mapping 


Satellite image of deforestation along the Trans-Amazonian highway in Brazil. 
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exercise could be repeated, and at finer 
scales, as circumstances on the ground 
change or data sources improve over time. 
Beyond limiting habitat destruction, a 
global road-zoning scheme could safeguard 
rare environments and areas rich in endemic 
species. It could prove useful for a number 
of projects already planned or under way, 
such as the Tanzanian government's plan 
(which is being legally challenged) to bisect 
Serengeti National Park with a highway 
that could disrupt the area’s world-famous 
migration of wildebeest and other wildlife, or 
the Ladia Galaska road network under con- 
struction in northern Sumatra, which will cut 
through hundreds of kilometres of rainforest 
and protected areas when complete. 
For transport projects that have high 
environmental costs 


“A global | but seem unavoid- 
road-zoning able, such as Brazil’s 
schemecould — Manaus-Porto Velho 
safeguard highway — which 
areas rich is now under con- 
in endemic struction and has the 


species.” potential to speed set- 
tlers and land specu- 
lators into the heart of the Amazon when 
complete — alternatives such as railways or 
river transport might be effective compro- 
mises. Trains and boats move people and 
products but limit the human footprint by 
stopping only at specific places”. 

Keeping roads out of natural areas is one 
of the most tractable and cost-effective ways 
to protect crucial ecosystems. In a world 
struggling to conserve nature as land-use 
pressures intensify, managing transport 
networks is where the rubber meets the 
road. = 
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“Mon premier gorille” from adventurer Paul Du Chaillu’s colourful 1863 account of his travels in Africa. 


EVOLUTION 


Beastly surprises 


Henry Nicholls relishes the history of the ‘ape man’ who braved the dangers of 
Gabon — and of Victorian England. 


nthe middle of the nineteenth century, the 

famed Victorian publisher John Murray 

released a title that took Britain by storm. 
It shot to the top of the best-seller list, divid- 
ing the scientific elite and raising troubling 
questions about the origins of humankind. 
The book was Explorations and Adventures 
in Equatorial Africa by Paul Du Chaillu. 

Never heard of it? That is perhaps due to 
our obsession with another of Murray’s titles, 
On the Origin of Species. It took decades for a 
true appreciation of Charles Darwin's work 
to begin to emerge; but Explorations and 
Adventures had a much more immediate and 
wide-reaching impact. Its enigmatic author 
became an overnight sensation. Now, jour- 
nalist Monte Reel goes to the heart of this 
human drama in Between Man and Beast, 
“the story of a nervy young man who rises, 
and occasionally falls, in a quest to construct 
a heroic destiny from scratch”. 

The first scientific description of the gorilla 
was given in 1847 to the Boston Natural His- 
tory Society, but was only read by a handful 
of the zoological elite. The tribes sharing a 
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forest with the ani- 
mal known locally as 
njena lived in fear of it, 
believing it lurked in 
trees, ambushing and 
choking humans. In 
1856, Du Chaillu, the 
twenty-something son 
of a French merchant 
living in West Africa, 
set out to explore the 
interior of Gabon — 
then a cartographic 
void allegedly rife with 
cannibals and njena. 
Three years later, 
Du Chaillu emerged 
from the malarious 
jungle with stories 
aplenty and the stuffed 


Between Man and 


Beast: An Unlikely 
Explorer, the 
Evolution Debates, 
and the African 
Adventure that 
Took the Victorian 
World by Storm 
MONTE REEL 
Doubleday: 2013. 
352 pp. $26.95 


skins of more than 20 western lowland goril- 
las. After a brief, largely unsuccessful, attempt 
to exhibit them in New York, he took his haul 
to London. 

Reel’s build-up to this moment is ingenious. 
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Over the course of several tight, deliciously 
crafted chapters that flit between Du Chaillu 
in Africa and the pioneering naturalist Rich- 
ard Owen in London, Reel sets these two 
outwardly incongruous characters on a col- 
lision course. When Du Chaillu turned up in 
London, his gorillas were the cargo of Owen's 
dreams. A sensational exhibition of these 
dramatically stuffed beasts, he felt, would 
only support his view (based on differences 
in brain anatomy) that humans could not be 
descended from apes. Owen led the young 
gorilla hunter into the innermost chambers 
of Britain's scientific establishment, and the 
heart of Victorian culture itself. 

Reel revels in this wider context. He 
exposes the underbelly ofa still-rampant slave 
trade, takes in a United States on the brink of 
the civil war, and conjures up the infamous 
Oxford meeting of the British Association 
for the Advancement of Science in 1860. 
There, Thomas Huxley clashed with both 
Owen and Samuel Wilberforce (the bishop 
of Oxford) over Darwin's fresh-off-the-press 
theory. Reel evokes the “soupy fog” of gaslit 
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London, with its peculiar blend of high soci- 
ety and “smudged underclass” of “ragpickers, 
costermongers, night-soil men, mud larks, 
shoeblacks, lamplighters, thimbleriggers”. 
Whenever Du Chaillu’s compelling drama 
touches on issues of race and racism, privi- 
lege and class, amateurism and professional- 
ism, celebrity and reputation, Reel takes an 
enriching detour into the details. 

With the runaway success of Explorations 
and Adventures in Equatorial Africa and 
packed lectures at London's elite scientific 
institutions, Du Chaillu and his gorillas 
became an instant cultural phenomenon. 
They inspired a deluge of (mostly favour- 
able) book reviews, plenty of satirical com- 
ment, cartoons, a poem, a sell-out tour ofa 
hirsute ‘freak’ described as “the facsimile of 
the gorilla” and a popular song and dance 
known as “The Gorilla Quadrille. They 
piqued the interest of literary lights such 
as Charles Dickens, who in 1861 wrote two 
articles about gorillas in the weekly maga- 
zine he edited, All the Year Round. In time, 
Du Chaillu became an inspiration for Arthur 
Conan Doyle's action-adventure novel The 
Lost World, Jack London's wilderness stories, 
Edgar Rice Burrough’s tales of Tarzan and 
Merian C. Cooper’s 1933 film King Kong. 

But the sensational colour Du Chaillu 
used to capture the public imagination 
began to arouse the suspicion of establish- 
ment figures. Perhaps his account was not 
entirely truthful — or was even completely 
fabricated. Some wondered whether he 
might have just bought the skins and skipped 
the jungle adventure altogether. “He wasn’t 
just walking the thin line between credibility 
and bravado; he was dancing on it,” writes 
Reel. “Every statement of fact in his book 
was now vulnerable to a contagion of doubt.” 

Du Chaillu responded by mounting a sec- 
ond expedition, vowing to collect not only 
more gorillas but every conceivable snip- 
pet of scientific data. Although disastrous, 
it proved just enough to save his reputation. 

Ironically, Between Man and Beast itself 
is not immune from a contagion of doubt. 
Reel asserts that every scene and quotation 
“is constructed from historical documents” 
and “physical descriptions and atmospheric 
details are rooted in factual evidence”. This is 
undoubtedly an honest statement, but seri- 
ous historians will be uncomfortable with 
Reel’s intense, near-filmic reconstruction 
of historical events, peppered with detail 
that is either unverifiable or based solely on 
Du Chaillu’s own recollections. For everyone 
else, however, this is what makes the book 
a supremely entertaining, enlightening and 
memorable read. m 


Henry Nicholls is a freelance journalist 
based in London. His most recent book is 
The Way of the Panda (Profile Books, 2010). 
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Books in brief 


The Scramble for the Amazon and the Lost Paradise of Euclides da 
Cunha 

Susanna Hecht UNIV. CHICAGO PRESS 600 pp. $45 (2013) 

Historian Susanna Hecht charts a brutal nineteenth-century 
resource grab: the US and European rubber raids in the Amazon. 
Woven through is the story of great Brazilian environmental 

writer, geographer and engineer Euclides da Cunha, whose survey 
expedition down the Amazon River exposed the raids’ grim ecological 
and human toll. Da Cunha’s account, Lost Paradise, masterfully mixes 
biology, geography and philosophy, but remains unfinished: he died 
violently. A journey into South America’s heart of darkness. 


RICHARD MABEy 


Turned Out Nice Again: On Living With the Weather 

Richard Mabey PROFILE Books 160 pp. £8.99 (2013) 

As denizens of an island prone to flooding, gales, drizzle and the 
occasional halcyon day, Britons obsess about meteorological 
vagaries. Nature writer Richard Mabey celebrates this preoccupation 
with the weather that, glue-like, bonds a nation. In a lyrical 90 pages, 
he takes us from freak events such as the January 1940 ice storm 
that saw cats “iced to branches”, to retinal detachment during 
extreme low fronts and the soggy impact of climate change. Mabey’s 
veerings — from forecaster-shamans to naturalist Gilbert White’s 
frozen pisspot — are as gloriously mercurial as the British weather. 


Noise: A Human History of Sound and Listening 

David Hendy PROFILE Books 402 pp. £16.99 (2013) 

An acoustic history spanning some 100,000 years, this BBC Radio 4 
tie-in is a smorgasbord of sound. David Hendy leads us into the 
‘voiceprints’ of human prehistory through the music of wind-torn 
forests, calling animals and stridulating insects that must have 
inspired our early ancestors. He then tackles the roots of oratory; 
the aural assault course that was ancient Rome; the specialized 
‘smart’ acoustics of churches and mosques; noise anxiety in an 
urbanizing, militarized world; the rise of recording technology; and 
the contemporary search for silence in the soundscape. 


Forecast: What Physics, Meteorology, and the Natural Sciences 
Can Teach Us About Economics 

Mark Buchanan BLOOMSBURY 272 pp. £18.99 (2013) 

Physicist Mark Buchanan rethinks the wobbly phenomenon of 
international finance. Markets, he argues, do not self-regulate. They 
are as vulnerable to severe ‘weather events’ as Kansas: prone, like 
any system, to positive feedbacks in which small variations can 
lead to sudden change. Buchanan argues for a physics-flavoured 
disequilibrium approach that would improve forecasts through the 
creation of models that are fine-tuned to recognized shifts in trade, 
speculation and other financial activities. 


Red Rover: Inside the Story of Robotic Space Exploration, from 
Genesis to the Mars Rover Curiosity 

Roger Wiens BASIC Books 256 pp. $25.99 (2013) 

Space hardware is in boom mode, points out geochemist — and 
principal investigator for Curiosity’s ChemCam instrument — Roger 
Wiens. Meshing the blow-by-blow science with memoir, his account 
begins with childhood rocket modelling and telescope building, 
moves on to the 2001 Genesis mission and culminates with the 
financial and technological rollercoaster in the lead-up to Curiosity. 
A tribute to human ingenuity in NASA's “faster, better, cheaper” era. 


21 MARCH 2013 | VOL 495 | NATURE | 311 


>) 2013 Macmillan Publishers Limited. All rights reserved 


| COMMENT | BOOKS & ARTS 


ag g Z 


A baby bonobo at the Lola Ya Bonobo sanctuary in the Democratic Republic of the Congo. 


PRIMATOLOGY 


A wild empathy 


Christopher Boehm relishes a wide-ranging 
assessment of primate morality. 


and the Atheist, is both an exceptionally 
good read and a tour de force of schol- 
arship. In it, de Waal states his argument for 
the evolution of human empathy with the 
sophistication of a well-grounded, risk-taking 
scientist who can venture into philosophy. 
De Waal draws on his own ethological 
research with chimpanzees and bonobos, 
as well as on biology and evolutionary psy- 
chology, to probe empathy asa key precursor 
for moral behaviour. The origins of human 
morality are not new territory for the pri- 
matologist and ethologist, who engaged 
with the question in Peacemaking Among 
Primates (Harvard University Press, 1990) 
and who, with Jessica Flack, has examined 
the evolutionary building blocks of morality. 
The novel element in The Bonobo and the 
Atheist is that de Waal analyses today’s moral 
landscape, in particular the schism between 
militant atheism and religion. 
De Waal views extreme strains of atheism 
as getting “all worked up about the absence 
of something’, at one point using the fanci- 


Pe de Waal’s latest book, The Bonobo 


ful device of a talking 

bonobo as his mouth- DNATURE.COM 
piece (hence the book's _ For Christian 
title). His view is that  Keyserson Frans 
religion is undeniably de Waal, see: 

in our bones — even _ go.tiature.com/pllidcq 
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though evidence of pri- 
mate precursors seems 
less than substantial. 
This does not mean 
that he is pro-religion, 
however. The Bonobo 
and the Atheist is per- 


meated with the ethos 
of secular humanism, y 
using the Renaissance The Bonobo ; 
painter Hieronymus Heim as 
Bosch’s The Garden of Humanism among 


Earthly Delights — a 
vision of humanity 
freed from narrow 
moral constraints — 
as a touchstone for his 
arguments. 

In his discussion of empathy and moral- 
ity, de Waal has little time for what he calls 
“veneer theories” that reduce altruism to ‘nat- 
ural selfishness. As he shows, human altru- 
ism has analogues in a wide range of species, 
even though sterile ants’ care for the offspring 
of their queen can hardly be labelled empathy. 
When dolphins assist humans struggling in 
the water, we may at least suggest some basic 
similarities. But when a chimpanzee, sharing 
more than 95% of our DNA, helps an unre- 
lated member of its group to lick a wound it 
cannot reach, a type of empathy very near the 


the Primates 
FRANS DE WAAL 

W. W. Norton: 2013. 
320 pp. $27.95, 
£18.99 
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human is surely coming into play. 

Many evolutionists favour chimpanzees 
as ancestral models. Whereas de Waal does 
look frequently to chimpanzees as exem- 
plars of primate altruism, he champions the 
less violent bonobo — not least because its 
habitat, like that of our common primate 
ancestor, remains the tropical forest, whereas 
chimpanzees and humans have evolved into 
ecological generalists. 

De Waal looks to mothering and infant 
care by non-kin, a basic form of empathy dis- 
cussed by primatologist Sarah Hrdy in Moth- 
ers and Others (Harvard University Press, 
2011), as the foundation of human altruism 
and complex cooperation, and as his prime 
evolutionary building block for morality. He 
also emphasizes the importance of emotion 
in moral choices, citing the work of psycholo- 
gist Jonathan Haidt, author of The Righteous 
Mind (Allen Lane, 2012). Haidt’s empirical 
investigations of subjects’ disgusted reactions 
to incest demonstrate that when it comes to 
morality, raw emotions trump rationality. 

The Bonobo and the Atheist does leave 
a gap of sorts in explaining moral origins. 
In discussing his building blocks, de Waal 
seems relatively unconcerned with the actual 
mechanisms of natural selection that keep 
them in place, be this through kin selection, 
mutualism, reciprocal altruism, group selec- 
tion or social selection. 

These theories can help to explain why nice 
guys don't necessarily finish last. Group selec- 
tion theory may be making a comeback, as 
seen in a 2010 paper in Nature by E. O. Wil- 
son and others; there are also social-selection 
arguments in Richard Alexander's The Biology 
of Moral Systems (Aldine Transaction, 1987), 
where he suggests that people's altruistic repu- 
tations may seriously enhance reproductive 
success. In my own Moral Origins (Basic 
Books, 2013) I suggest that social selection in 
the form of groups punishing moral deviants 
has profoundly shaped human gene pools, 
and given us a conscience — an argument de 
Waal does discuss. 

De Waal’s style is refined yet lively; his 
humanism both highbrow and lowbrow. 
One minute he is referencing the art of 
Bosch, the next comparing himself to an 
Australian toilet frog, showered over the 
decades by distastefully superficial socio- 
biological explanations of human altruism. 
The Bonobo and the Atheist is a synthesis on 
all levels, masterfully marshalling ethology, 
psychology, philosophy and anthropology 
in its drive to understand ourselves through 
the lens of other primates. m 


Christopher Boehm is professor of 
biological sciences and anthropology and 
director of the Goodall Research Center at 
the University of Southern California, Los 
Angeles, USA. 
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EVOLUTIONARY BIOLOGY 


Gentle giant of genetics 


Oren Harman assesses the first biography of biologist 
W. D. Hamilton, the ‘greatest Darwinian since Darwin’. 


e was socially awkward, charmingly 
Hisstersosne honest, modest, a 

champion of underdogs. A romantic 
who saw the human condition as essentially 
tragic, he felt more at home in the Amazon 
bush than with people, judging culture “a 
braggart”; the genes, to his mind, really ran 
the show. And his theory of inclusive fitness, 
many hold, is the single most original devel- 
opmentin evolutionary theory since Darwin, 
explaining the workings of natural selection 
through the ‘gene’s eye’ point of view. 

The British evolutionary biologist 
W. D. “Bill” Hamilton (1936-2000) was a 
naturalist of legendary knowledge, especially 
of insects, and a complex man, both rugged 
and compassionate. An uncompromising 
truth-seeker, he set the highest of stand- 
ards for his scientific papers, which were 
often rejected, famously by Nature. Now, in 
Nature’s Oracle, sociologist Ullica Segerstrale 
presents the first biography of this gentle 
giant with a poet's soul. 

Hamilton asked big questions and provided 
answers just as dramatic. Where does altru- 
ism come from? Why does sex exist? Why 
are females attracted to males? How does 
cooperation in nature arise? Marshalling 
mathematics, and later computer modelling, 
in the service of an acute power of observa- 
tion and an ability to see the world from the 
point of view of the tiniest biological crea- 
tures and entities — the wasp, the virus, the 
gene — Hamilton reached often counter- 
intuitive solutions. Sex is merely an adapta- 
tion to avoid bugs, with ourselves and other 
diploids as “guilds of genotypes’, as he put it, 
“committed to free, fair exchange of biochem- 
ical technology for parasite exclusion”. Coop- 
eration is a game played by the self-interested, 
nothing more; aesthetics is a signal for good 
genetics. meant to garner mates. 

An intellectual and physical daredevil, 
Hamilton was also often distracted and 
sometimes depressed. Sporting a caveman- 
like shock of thick white hair, he was famous 
for mumbling inaudibly, his back to the audi- 
ence, during virtually incomprehensible lec- 
tures in the United States and Britain. He felt 
alone and uncomprehended. Despite his unu- 
sual generosity in giving credit to others for 
his own fountain-like ideas, Segerstrale also 
uncovers an exaggerated jealousy for primacy, 
and an ‘outsider’ mentality that persisted long 
after he had become celebrated. 

It is not easy to remain unmoved by 
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who left behind a bru- 
tally honest intellectual Nature’s Oracle: 
autobiography, Narrow A Life of W. D. 
Roads of Gene Land, in Hamilton = 

: ULLICA SEGERSTRALE 
two volumes (a third — oygorq University 
followed, edited post- Press: 2013. 441 pp. 
humously). Readers £25, $35 


will be excused for not- 
ing that Segerstrale’s biography seems to rely 
rather heavily on these. “I tend to be more like 
a bird than alligator — hatching my ideas to 
maturity before letting them loose,’ Hamilton 
wrote of himself. Passing by a yellow ragwort 
struggling to grow through a crack in Lon- 
don’s asphalt, he noted, “a screaming child 
would not have touched my heart in the same 
way’. It's a tough charge to improve on that. 
In places, under-reliance on the second- 
ary literature leads to either conjecture (that 
J. B. S. Haldane sought to prove altruism 
compatible with natural selection because 
of his left-leaning political tendencies) or 
factual fuzziness (missing the crucial impact 
of George Price's covariance paper on Ham- 
ilton’s view on group selection). A somewhat 
incomplete treatment of Hamilton's intellec- 
tual legacy also leaves the reader lacking the 


W. D. Hamilton a few years before his death. 
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wider context. In the past few years, the pages 
of Nature have seen a hard-fought debate 
between proponents of Hamiltonian kin 
selection and its challengers, but Segerstrale 
chooses not to delve deeply into the contro- 
versy. Had she done so, she would have been 
able to show the extent to which E. O. Wilson’s 
current views on the evolution of eusociality 
represent a return to Hamilton's ideas. 

Hamilton was indeed an inveterate seeker 
of patterns, a man with “evolution on the 
brain’, as Segerstrale puts it, but he sometimes 
took this too far. His application of extreme 
geneticism to humans is an example. Wary 
of the adverse effects of modern medicine 
on the gene pool, he called unapologeti- 
cally for infanticide of the genetically weak. 
This is one instance in which the otherwise 
singular Hamilton is anything but unique, 
joining a long line of naive champions of 
eugenics — and it demands more detached, 
hard-nosed criticism than offered here. 

Still, combing through letters and scru- 
pulously interviewing family members, 
colleagues and friends, Segerstrale succeeds 
in bringing Bill Hamilton, warts and all, to 
what will be an appreciative audience. “Iam 
ashamed that in practice I find myself much 
more selfish than my ideal,” the man who 
tackled altruism confessed touchingly in a 
long, paranoid letter to John Maynard Smith 
in 1977, accusing Smith of not giving him 
sufficient credit for the idea of inclusive fit- 
ness. Hamilton’s mother, a medical doctor, 
saw signs of Asperger's syndrome (an autism 
spectrum disorder) in characteristics such 
as his obsession with priority, preoccupa- 
tion with patterns, minuscule handwriting 
and focus on number and detail. He himself 
thought, tellingly, that being a ‘thing’ rather 
than a ‘people, person accounted for his 
insight into nature. 

Whatever the truth in these suppositions, 
Segerstrale’s biography of Hamilton is ulti- 
mately a celebration of the life in science of 
a deeply feeling human being — an “evolu- 
tion sufferer” as he called himself. In the 
grand scheme of nature, Hamilton knew, a 
human life is desperately transient. So did 
his favourite poet, A. E. Houseman: 


For nature, heartless, witless nature, 

Will neither care nor know 

What stranger’ feet may find the meadow 
And trespass there and go, 

Nor ask amid the dews of morning 

If they are mine or no. 


Despite it all, Bill Hamilton’s life is one to 
remember. 


Oren Harman is professor of the history of 
science at Bar-Ilan University, Israel, and 
author of a biography of George Price, The 
Price of Altruism (W. W. Norton, 2010). 
oren.harman@gmail.com 
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Medical supplies in 
Iran hit by sanctions 


The delivery and manufacture 
of vital medical supplies in Iran 
is being impeded by bilateral 
sanctions on the country’s 
economy, imposed to force Iran 
to comply with international 
rules on nuclear programmes. 
This situation is causing a 
humanitarian disaster and 
contravenes the Universal 
Declaration of Human Rights. 

Medicines are not themselves 
subject to sanctions, but the 
licensing, purchase and shipment 
of imported medicines have all 
been hit. This, combined with 
the potential threat of sanctions 
by international pharmaceutical 
companies and banks, has led to 
a shortage of specific drugs and 
medical facilities in Iran over 
the past few months, increasing 
drug prices by 50% and affecting 
more than 6 million chronically 
ill patients. 

Experience in Iraq, Cuba, 
Libya and the former Yugoslavia 
indicates that sanctions seldom 
meet their political objectives. 
But they have caused large-scale 
humanitarian disasters in those 
countries (see, for example, 

M. M. AliandL. H. Shah Lancet 
355, 1851-1857; 2000). 

The Iranian authorities 
must take immediate action to 
block the effect of sanctions on 
the supply of medicines and 
of materials for hospitals and 
laboratories. 

The international community 
needs to establish criteria for 
guaranteeing that medical 
products will always be exempt 
from sanctions. 

Ali Gorji Epilepsy Research 
Centre, University of. Miinster, 
Miinster, Germany. 
gorjial@uni-muenster.de 


Fisheries: broaden 
the arguments 


The split over whether catch is a 
useful indicator of fish abundance 
(D. Pauly, R. Hilborn and T. A. 
Branch Nature 494, 303-306; 


2013) is a false dichotomy. Not 
because a realistic position lies 
somewhere in between, but 
because the focus is too narrow. 

In the whole-ecosystem 
approach to fisheries 
management, data on all 
species, including by-catch 
species, must be considered. 
A broad range of information 
is therefore used to assess the 
health of fisheries worldwide, 
from no-data situations that 
rely on precautionary and risk- 
based approaches, to data-poor 
fisheries in which catch data are 
unreliable, as well as data-rich 
fisheries that are well supported 
by catch data and fishery- 
independent surveys. 

As climate change alters the 
distribution and abundance 
of different species, catch data 
may become less useful and 
new methods will be needed for 
assessing fish stocks. However, 
even the best estimates of fish 
numbers will not improve 
fisheries’ health without good 
leadership, robust social capital, 
and effective conservation and 
stock-rebuilding policies. 
Alistair J. Hobday CSIRO 
Marine and Atmospheric 
Research, Hobart, Tasmania, 
Australia. 
alistair.hobday@csiro.au 
Rodrigo H. Bustamante, Eva 
E. Plaganyi CSIRO Marine and 
Atmospheric Research, Brisbane, 
Queensland, Australia. 


Fisheries: manage 
declines 


Catch data are first and 
foremost measures of fishery 
output (D. Pauly, R. Hilborn 
and T. A. Branch Nature 494, 
303-306; 2013). A long- 

term decline in catch can be 

a warning that conditions 

have changed, either in the 
abundance and productivity of 
fish populations or in fishing 
practices. Although increases 
in fishing effort can sometimes 
temporarily restore yields, it 

is important to investigate the 
causes of declines. 
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When time series of 
commercial catch data are the 
only information available, there 
are ways to supplement them with 
life-history information (such as 
maximum body length) to expose 
important changes in population 
abundance (see C. Costello et al. 
Science 338, 517-520; 2012). 

In areas of the world that 
cannot afford investigations of 
abundance, a reduction in fishing 
intensity is prudent precautionary 
management (see Code of 
Conduct for Responsible Fisheries 
UN Food and Agriculture 
Organization; 1995). 

Brian R. MacKenzie, Mark 
R. Payne National Institute for 
Aquatic Resources, Technical 
University of Denmark, 
Charlottenlund, Denmark. 
brm@aqua.dtu.dk 


Australia’s grant 
system wastes time 


We found that scientists in 
Australia spent more than 

five centuries worth of time 
preparing research-grant 
proposals for consideration by 
the largest funding scheme of 
2012. Because just 20.5% of these 
applications were successful, the 
equivalent of some four centuries 
of effort returned no immediate 
benefit to researchers and wasted 
valuable research time. The 
system needs reforming and 
alternative funding processes 
should be investigated. 

We surveyed a representative 
sample of Australian researchers 
and found that preparing new 
proposals for the National 
Health and Medical Research 
Council's project grants took 
an average of 38 working days; 
resubmitted ones took 28 days 
on average. Extrapolating 
this to all 3,727 submitted 
proposals gives an estimated 550 
working years of researchers’ 
time (95% confidence interval, 
513-589), equivalent to a 
combined annual salary cost of 
Aus$66 million (US$68 million). 
This exceeds the total salary 
bill (Aus$61.6 million) at 
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Melbourne's Walter and Eliza 
Hall Institute, a major medical- 
research centre that produced 
284 publications last year. 

The grant proposals we 
analysed were typically 80-120 
pages long. If these were more 
focused, it would reduce 
preparation costs and could 
improve the quality of peer 
review by reducing workloads. 
Danielle L. Herbert, Adrian 
G. Barnett, Nicholas Graves 
Queensland University of 
Technology, Brisbane, Australia. 
n.graves@qut.edu.au 


Outdated taxonomy 
blocks conservation 


We agree with Frank 

Zachos and colleagues that 
splitting mammalian species 
taxonomically without scientific 
justification could impede their 
conservation (Nature 494, 35; 
2013). But so does uncritically 
lumping them together. 

Mammalogy is beleaguered 
by a dogmatic regard for mid- 
twentieth-century propositions, 
which were seldom based 
on critical study and lacked 
phylogenetic information. 
Species were lumped together 
and incorporated into influential 
classification checklists to 
simplify regional faunas and 
make them more manageable for 
non-taxonomists. 

Modern integrative 
approaches have shown that 
this tactic has hidden an 
incommensurable number 
of distinctive species 
from conservation efforts 
(W.R. Morrison et al. Biol. 
Conserv. 142, 3201-3206; 2009), 
thereby increasing the risk of 
extinctions. 

Scepticism should be accorded 
to any taxonomy that is not based 
on comprehensive revisionary 
work, phylogenetic studies or, 
ideally, both. 

Eliécer E. Gutiérrez, Kristofer 
M. Helgen National Museum 

of Natural History, Smithsonian 
Institution, Washington DC, USA. 
gutierreze@si.edu 
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3D without the glasses 


A glasses-free three-dimensional display technology has been invented that may be an exciting alternative to current 
solutions for mobile devices. It makes use of an optical effect known to school physics students. SEE LETTER P.348 


NEIL A. DODGSON 


e are all familiar with the glasses 
needed to watch three-dimen- 
sional (3D) movies. But glasses are 


inappropriate for many possible uses of a 3D 
display. There are a range of technologies 
that can achieve 3D without the glasses. On 
page 348 of this issue, Fattal et al.' describe a 
new form of illumination device that could 
help to achieve it for mobile devices. 

Humans see the world stereoscopically. 
That is, our two eyes see two slightly different 
views of the world because they are separated 
by about 63 millimetres”. Our brains combine 
these two views into an internal 3D model 
of whatever we are seeing. Conventional 2D 
displays provide only a single flat image, so 
the two eyes both see the same picture on the 
screen. A 3D display device must instead pre- 
sent a different image to each eye. 

Glasses-based 3D systems can work in 
several ways. One way, known as active shut- 
tering, uses rapidly flickering shutters in the 
glasses, synchronizing them with rapid alter- 
nation of left and right images on the screen to 
give a different image to each eye. This is how 
most home 3D televisions work. A cheaper 
solution is passive polarization’, in which the 
two lenses in the glasses are polarized in dif- 
ferent directions, with the display showing two 
images simultaneously, one with each polariza- 
tion. This is how most 3D cinema has worked 
since the 1950s. And then there is the cheap 
and cheerful anaglyph solution: the familiar 
red-green glasses*, where each eye's image is 
produced using filters of different colours (red 
and green). Anaglyph was used for the earli- 
est experiments in 3D movies, in 1922, but 
was quickly found to cause headaches, hence 
the migration to polarized systems. However, 
anaglyph is still used as a cheap alternative, 
especially in print media. 

Consider now how you could achieve 3D 
without using any special glasses. You would 
need to send a different image to each eye, but 
the mechanism for doing this now needs to 
be in the display itself. The individual pixels 
in the display therefore need directional con- 
trol. Conventional 2D displays have pixels that 
send light in all directions. A 3D display needs 
pixels that send light in carefully constrained 
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Figure 1 | Autostereoscopic effects. a—c, Three 
examples of a 3D imaging display in which the 3D 
effect is encoded in the display and can be viewed 
with no need for glasses. a, Only two viewing 
zones, one for each eye. A viewer with their nose 
in the right place sees 3D. Anyone else sees just 
2D: the same image to each eye. b, Only two 
views, but the zones repeat. A viewer in the right 
place sees 3D. A viewer in the wrong place sees 
pseudoscopically: the wrong image to each eye. 

c, Sixteen views. Everyone sees a 3D image. This is 
the effect of multiview autostereoscopic displays 
such as that proposed by Fattal and colleagues’. 


directions, so that different light reaches each 
eye and the viewer’s two eyes thus see different 
images on the screen. You might assume that 
just two constrained directions are needed: 
one for each eye (Fig. 1a,b). But this is not the 
case, because the display does not know where 
your eyes are and therefore it has to provide 
imagery across the entire viewing space, to give 
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the correct 3D effect no matter where the eyes 
are placed. 

The ideal solution is to have pixels that are 
able to display different colours in every possi- 
ble direction. This can be achieved using holo- 
graphic techniques but, currently, holography 
is practical only for still images. For updatable 
displays, or 3D video, we need the next best 
thing to holography: autostereoscopic multi- 
view 3D display. That is, a stereoscopic display 
(a different picture to each eye) that works 
automatically (without requiring the user to 
wear any special device) and displays multiple 
different images on one display screen, each 
visible from particular places in front of the 
screen. Figure 1c shows such a display with 
16 different viewing zones’. In each zone, a 
different image is visible on the screen. Every 
viewer sees in 3D (a different picture to each 
eye) and each viewer sees 3D from their par- 
ticular position (different pictures as you move 
your head from side to side). 

There are several ways to achieve this for 
large-scale displays: optical elements such 
as lenticular arrays® or parallax barriers’ in 
front of a liquid-crystal-display panel, multi- 
ple projectors behind a large lens system’, or 
rapid display of multiple images combined 
with complex optical switching’. Lenticular 
arrays are a relatively successful mechanism for 
desktop 3D. They have the advantage that their 
constituent lenticules (tiny abutting cylindri- 
cal lenses) comprise a thin sheet in front ofa 
liquid-crystal display, making them cheap to 
manufacture. The disadvantage of lenticular 
arrays, shared with parallax barriers, is that 
they reduce the native resolution of the under- 
lying display by a factor equal to the number of 
distinct viewing directions. 

The most familiar glasses-free mobile 
display is the Nintendo 3DS, which uses a 
parallax-barrier system to create two viewing 
zones. With only two views, the user has to be 
roughly the right distance from the screen and 
must ensure that their nose is on the boundary 
between the two zones, so that each eye sees the 
appropriate view. The optics of these devices 
mean that the viewing zones are repeated as the 
head is moved left-right (Fig. 1b). Lenticular 
arrays, producing the same kind of effect, have 
also been used for other mobile devices. 

Fattal et al. offer a different approach to 


producing an autostereoscopic multiview dis- 
play. They use alight guide combined with dif- 
fraction gratings (see Fig. 1a,b of the paper’) to 
direct light in specific directions. Their dem- 
onstration models can send light in 14 distinct 
viewing directions, and they believe that the 
design can be made to produce up to 64. The 
implication is that their devices could be used 
to produce an autostereoscopic display with up 
to 64 distinct viewing zones. With that many 
zones, the inter-zone spacing could be con- 
siderably smaller than the distance between 
the eyes, producing a smooth 3D effect that 
would be similar to viewing a white-light 
hologram. 

However, there are considerable challenges 
to overcome before we will see hologram-like 
displays on a mobile device. First, we need to 
make sure that 3D will not compromise spatial 
resolution. Most current mobile devices have 
pixels about 250 micrometres across. The new 
illumination system has individual direction 


STEM CELLS 


pixels 36 micrometres across, theoretically 
allowing us to pack up to 48 directions into 
the space of a single pixel. This device thus 
seems to give us the desired directional con- 
trol with no loss of spatial resolution. Next, the 
3D effect must not compromise image qual- 
ity. The example images in the paper show 
that there is considerable work to be done to 
improve quality to an acceptable level. Then, 
the display must have the 64 images to display. 
This is straightforward, although computa- 
tionally expensive, with computer-generated 
imagery but is a significant challenge for 
live action. 

Finally, the new illumination system must be 
manufactured reliably, robustly and in quan- 
tity. This is a matter of careful engineering 
that can take a long time. Iam reminded of the 
nine-year development of Texas Instruments’ 
digital micro-mirror device, now widely used 
in digital projectors, between the invention in 
1987 and the first commercial product in 1996. 


Painkillers caught in 
blood-cell trafficking 


Haematopoietic stem and progenitor cells move from the bone marrow into the 
circulation to replenish normal blood-cell levels. Inhibiting a prostaglandin- 
mediated signalling pathway may promote this process. SEE LETTER P.365 


JASON M. BUTLER & SHAHIN RAFII 


system reside in the safe haven of their 

immediate surroundings — a micro- 
environment that contains a complex net- 
work of ‘niche’ cells. Interaction with niche 
cells prevents unauthorized egress of these 
haematopoietic stem and progenitor cells into 
the circulation, whereas their mature counter- 
parts are readily launched there. How the 
various niche cells decide to relocate some cells 
but not others, to meet the demand for blood 
cells under normal and stressful conditions, is 
not fully understood. On page 365 of this issue, 
Hoggatt et al.’ provide one notable clue*. 

The authors show that interfering with 
the interaction between the messenger lipid 
molecule prostaglandin E, (PGE,) and its 
G-protein-coupled receptor EP4 stimulates the 
mobilization of haematopoietic stem cells and 
haematopoietic progenitor cells (HSCs/HPCs) 
from the bone marrow to the circulation. This 
effect was evident following deletion of the EP4 
gene in mice; using EP4 antagonists; and after 
lowering PGE, production by treating mice 


S™ and progenitor cells of the blood 


*This article and the paper under discussion? were 
published online on 13 March 2013. 


with non-steroidal anti-inflammatory drugs 
(NSAIDs), including aspirin, ibuprofen and 
meloxicam. 

Individually, NSAIDs and EP4 antagonists 
were marginally effective in forcing HSC/ 
HPC egress, but their effect was consider- 
ably improved when they were used together 
with other mobilizing agents — G-CSF and 
a CXCR4 antagonist. Most importantly, 
the authors report such enhanced effects of 
NSAIDs, alone or in combination with G-CSF 
in non-human primates and human subjects. 
This sets the stage for clinical studies to inter- 
rogate the potential of PGE, for optimizing 
HSC/HPC mobilization protocols as part of 
therapies involving myeloablation, which 
involves the destruction of diseased blood cells 
followed by induced production of new blood 
cells from the stem-cell pool. 

The mechanism by which PGE,-EP4 signal- 
ling regulates blood-cell trafficking is complex 
and involves balancing HSC/HPC mobilization 
both directly, through cell-specific signalling, 
and indirectly, by activating stromal cells of 
the niche. Together with earlier work”’ show- 
ing that, by contrast, short-term exposure 
of haematopoietic cells to PGE, increases 
their migration and engraftment, these 
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By analogy, if the authors can solve the prac- 
tical problems, then they have a compelling 
alternative to existing 3D display technology. 
All that remains is the more nebulous question 
of whether humans want or need 3D displays. 
Time will tell. m 


Neil A. Dodgson is at the Computer 
Laboratory, University of Cambridge, 
Cambridge CB3 OFD, UK. 

e-mail: neil.dodgson@cl.cam.ac.uk 
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findings qualify the PGE,-EP4 signalling 
hub as a gatekeeper of the haematopoietic 
microenvironment (Fig. 1). 

Uncovering the molecular pathways 
involved in the efficient mobilization and 
engraftment of HSCs/HPCs has major impli- 
cations for the treatment of various cancers 
with myeloablative therapies*’. High blood 
levels of a range of factors — including G-CSF, 
CXCR4 antagonists, numerous CC and CXC 
chemokines’, VEGF-A and PIGF (ref. 5) — 
were known to stimulate the mobilization of 
HSCs/HPCs localized to specific haemato- 
poietic niches. However, the identity of 
factors that could modulate both mobilization 
and engraftment of these cells were not known. 

PGE, is one such factor, and, by interact- 
ing with EP4, it regulates bidirectional traf- 
ficking of HSCs/HPCs. Indeed, the effects 
of PGE, could be biphasic. Brief stimulation 
of HSCs/HPCs with this molecule augments 
engraftment and reconstitution of blood cells 
by accelerating haematopoiesis®. This function 
of PGE, is mediated by activation of the Wnt 
signalling pathway to control proliferation and 
programmed death of HSCs/HPCs’. Non- 
stop stimulation, however, may inhibit HPC 
expansion and recovery of haematopoiesis®, as 
well as prevent the trafficking of HSCs/HPCs, 
partly by CXCR4 downregulation. Therefore, 
capitalizing on the potential of the PGE,-EP4 
signalling pathway to augment haematopoiesis 
may require a two-pronged approach in which 
EP4 is initially activated to trigger migration 
and engraftment of HSCs/HPCs, and then 
inhibited to encourage HPC expansion and 
haematopoietic reconstitution. 

An intriguing question is exactly how 
NSAIDs modulate trafficking of the HSCs/ 
HPCs within their microenvironment. EP4 is 
expressed by bone cells called osteoclasts and 
osteoblasts; by endothelial cells, which line the 
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Figure 1 | NSAIDs are gatekeepers of the haematopoietic microenvironment. a, Prostaglandin E, 
(PGE,) can bind to its receptor EP4 on the surface of haematopoietic stem and progenitor cells 
(HSCs/HPCs), as well as on the surface of niche cells within the HSC/HPC microenvironment, which 
include perivascular stromal cells, endothelial cells that line blood vessels and cells of the bone lineage. 
The signalling cascade initiated by PGE,-EP4 interaction inhibits mobilization of HSCs/HPCs into the 
circulation. b, Hoggatt et al.’ show that NSAIDs, by inhibiting PGE,-EP4 interactions and in part by 
thinning of the bone cells, can synergize with the mobilizing agents G-CSF and CXCR4 antagonists. 
This promotes egress of HSCs/HPCs into the circulation by weakening their tethering to the niche cells. 


blood vessels; by subsets of haematopoietic 
cells; and most probably by perivascular 
cells, which occur in peripheral blood ves- 
sels. Hoggatt et al. show that treatment with 
NSAIDs results in thinning and loss of the cells 
of the bone lineage, diminishing their capacity 
to retain HSCs/HPCs within the bone marrow. 
Moreover, they find that NSAIDs decrease 
bone-derived osteopontin, a molecule that 
regulates mobilization of haematopoietic cells. 
These painkillers might therefore primarily 
target bone cells to decrease the tethering of 
HSCs to stromal cells. 

Nonetheless, PGE, might also induce 
mobilization by directly affecting perivascu- 
lar and endothelial cells. Notably, sinusoidal 
endothelial cells line small blood vessels called 
sinusoids. The sinusoids, which form branch- 
ing structures in much of the bone-marrow 
vasculature, are not mere passive conduits for 
delivering oxygen and nutrients, but rather 
represent specialized blood vessels that form 
an instructive ‘vascular niche’ in the haemato- 
poietic organs. Normally, activated endothelial 
cells produce defined blood-related growth 
factors (angiocrine factors) to balance the 
self-renewal and differentiation of HSCs”. 
One angiocrine factor is PGE,, through which 
endothelial cells could modulate HSC/HPC 
trafficking. Indeed, during recovery from 
myeloablative therapy, the induction of 
angiocrine factors — including PGE, — in 
endothelial cells regulates trafficking and 
regeneration of HSC/HPCs’*"“*. Unravelling 
the mechanism by which PGE, polices the 
trafficking of HSCs/HPCs within the complex 
network of intertwined niche cells will require 
the deletion of EP4 in specific niche cells, 
including endothelial cells, perivascular 
cells and cells of the bone lineage. 

Notwithstanding the potential benefits 
of NSAIDs, these drugs may act as a 


double-edged sword during haematopoietic 
recovery. Because activation of PGE,-EP4 
signalling promotes blood-vessel formation, 
NSAIDs might interfere with the regeneration 
of endothelial cells, thus impairing haemat- 
opoietic reconstitution. These complex issues 
could also be addressed by differentially delet- 
ing EP4 in various bone-marrow stromal 
niche cells, specifically endothelial cells dur- 
ing recovery from myeloablation. In addition, 
selective deletion of prostaglandin-E synthase 
enzymes in various niche cells will define the 
specific niche cells that functionally deploy 
PGE, to activate HSCs/HPCs. 

The present findings are relevant to whether 
NSAIDs can restore the mobilization of 


HSCs/HPCs in patients who are resistant to 
treatment with G-CSF and CXCR4 antago- 
nists. It is plausible that resistance to these fac- 
tors is not due to intrinsic HSC/HPC defects, 
but rather reflects impaired instructive func- 
tions of haematopoietic niche cells. Thus, 
NSAIDs could pave the way for defining alter- 
native routes to safely mobilize HSCs/HPCs 
through manipulation of haematopoietic 
niche cells. These observations also shed light 
on the mechanism by which NSAIDs may con- 
tribute to protection against cardiovascular 
disorders, by mobilizing haemangiocytes — 
HPCs that promote organ-specific formation 
of functional blood vessels’”. = 
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Mainly in the plain 


The finding that global mass loss from landscapes is dominated by physical 
erosion and chemical weathering from flat terrain, rather than from mountains, 
challenges our understanding of how Earth’s surface evolves. 


JAMES W. KIRCHNER & KEN L. FERRIER 


viduals, we humans last only some decades 

or so. Our most Ozymandian creations 
crumble into oblivion over hundreds or thou- 
sands of years. And on million-year timescales, 
even the landscapes on which we build our 
civilizations will disappear, gradually con- 
sumed by physical erosion and chemical disso- 
lution. How quickly these processes transform 
Earth’s surface has been a question of great 
interest to geologists and geochemists. 


[eens everything falls apart. As indi- 
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But they may have mostly been looking for 
the answer in the wrong places, Willenbring 
et al.' report in Geology. Geologists love moun- 
tains, and have focused their attention on 
steep terrain, where rapid erosion is thought 
to drive rapid chemical weathering”. However, 
Willenbring et al. say that, on a global basis, 
there is just not enough mountainous terrain 
to account for much of the global flux of sedi- 
ment (from erosion) or solutes (from chemical 
weathering) reaching the oceans. 

This argument is likely to arouse interest, 
and some measure of controversy, because 


it concerns not only the evolution of Earth’s 
surface, but also its long-term climate. When 
calcium and magnesium are dissolved from 
silicate minerals and later deposited as car- 
bonates on the ocean floor, carbon dioxide is 
removed from the ocean-atmosphere system. 
This mechanism may stabilize global climate 
over the long term if the climatic shifts that 
accompany higher atmospheric CO, levels 
also accelerate global erosion and weathering 
fluxes. 

Willenbring and colleagues’ analysis is 
based on a global compilation of what geolo- 
gists term denudation rates: these are rates of 
mass loss, or of equivalent surface lowering, 
from landscapes as a result of physical erosion 
and chemical weathering. Long-term denu- 
dation rates can be inferred from cosmogenic 
nuclides — rare isotopes that are produced 
inside mineral grains by cosmic-ray bom- 
bardment’. Measuring the concentrations of 
these nuclides is technically demanding, but 
the effort is worthwhile because they intrin- 
sically reflect average denudation rates over 
thousands or tens of thousands of years. And 
when they are measured in handfuls of river 
sand, they reflect the average denudation rate 
of the entire contributing drainage basin. In 
the nearly 20 years since this technique was 
proven to work’, more than 1,000 such meas- 
urements of basin-scale denudation rates have 
been published. 

Using their global compilation of cosmo- 
genic-nuclide measurements, Willenbring 
et al. show that roughly half of the worldwide 
variation in denudation rates can be explained 
by a simple exponential function of average 
basin slope. This implies that other factors 
such as climate, rock type and vegetation must 
be of secondary importance. Another recent 
compilation of cosmogenic-nuclide data’ is 
consistent with this exponential relationship, 
as are other, smaller data sets covering differ- 
ent spatial scales, timescales and measurement 
methods*®”. This exponential relationship 
implies that denudation rates do not decline 
to zero as slope approaches zero; instead, 
they converge to a constant value of roughly 
0.01-0.1 millimetres per year. 

These rates are slower than those in steep 
mountainous terrain by an order of magni- 
tude or more, but gentle landscapes are so 
abundant that they dominate the global denu- 
dation budget. Willenbring and co-workers 
calculate that landscapes with slopes of less 
than 100 metres per kilometre account for 
more than nine-tenths of Earth’s land surface 
and eight-tenths of the global denudation flux, 
even when the faster rates in steeper terrain 
are taken into account. The implication is that 
understanding how Earth’s topography and 
climate have co-evolved over geological time 
— and how they will co-evolve in the future — 
will require much better data on erosion and 
weathering processes in flat terrain. 

Getting those data will be a challenge. We 


NEWS & VIEWS | RESEARCH | 


Figure 1 | Lowland flat, mountain high. Flat terrain surrounds towering peaks in the eastern Tibetan 
Plateau. Willenbring and colleagues’ analysis' suggests that geologists need to focus on gently sloping terrain, 
rather than mountains, if we are to understand the denudation processes that transform Earth’s surface. 


currently have far fewer measurements of long- 
term denudation rates in gentle terrain than 
in steep terrain®. Not one of the nearly 1,000 
denudation-rate measurements in Willenbring 
and colleagues’ database comes from a land- 
scape with a slope of less than 10m km”’, even 
though such landscapes account for roughly 
half of Earth’s land surface’. 

There are details in the paper that one 
can argue over. For example, measurements 
of topographic slope are scale-dependent; 
widely spaced elevation measurements strad- 
dle valleys and ridges and thus underestimate 
topographic gradients, potentially by a lot. 
Therefore, Earth may not be nearly as flat as 
one would infer from the global topographic 
data set used by the authors, which has a spatial 
resolution of only 1 km. 

More fundamentally, a lot of gentle terrain 
consists of depositional landforms such as 
basins, deltas, fans and playas, where sediment 
accumulates, rather than erodes, over the long 
term. Cosmogenic nuclides can say nothing 
useful about the denudation rates of deposi- 
tional landscapes (which are, strictly speaking, 
negative). Therefore, cosmogenic nuclides are 
typically measured only in the subset of gentle 
landscapes that are actively eroding, and not 
in those that accumulate sediment. This will 
lead to a systematic upward bias in the inferred 
average denudation rate of gentle terrain, and 
so to an upward bias in the importance of 
gentle terrain in Willenbring and colleagues’ 
global denudation budget. The magnitude of 
this bias is unknown. 

Nonetheless, the authors’ analysis does 
suggest that more geologists should climb 
down from the mountains that they love so 
much, to explore the erosion and weathering 
of the surrounding lowlands (Fig. 1). When 
they get there, however, they will find that 
many of these landscapes have been utterly 


transformed by human activities, which have 
greatly accelerated soil-erosion rates — to the 
point that they rival or exceed natural erosion 
rates in some of the steepest mountains on 
Earth, and threaten the long-term future of 
our food supply’. 

This is not a new problem: the crumbling 
relics of many vanished civilizations bear mute 
witness to the fate of those who squander their 
soil resources”. What is new, this time around, 
is that we know we are destroying our soil, 
and we know how to stop. The scientific basis 
for soil conservation has been understood 
for decades. What remains to be discovered, 
in many cultural landscapes, is the will to 
implement it. = 
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Trouble upstream 


Mutations in the promoter sequence of the gene encoding a component of the 
enzyme telomerase have been discovered in patients with melanoma, suggesting 
that mutations in this regulatory region can drive tumour development. 


E. ELIZABETH PATTON & LEA HARRINGTON 


ne of the hallmarks of cancer is the 
() increased production of telomerase, 

an enzyme that replenishes chromo- 
some ends and enables continued cell division. 
But how and when does production of telo- 
merase become selected for in the evolution of 
a tumour? Writing in Science, Huang et al.' and 
Horn et al.’ show that subtle genetic changes 
in the promoter region that lies upstream of 
the gene encoding the telomerase subunit 
TERT (telomerase reverse transcriptase) are 
driver mutations in melanoma, and possibly in 
other cancers. 

The two groups arrived at this discovery 
from different perspectives. Huang and col- 
leagues analysed whole-genome sequence data 
for cells taken from human malignant mela- 
nomas and focused on the DNA that was not 
part of protein-coding sequences. In more than 
70% of these melanoma samples, they found 
one of two single-nucleotide C-to-T mutations 


UV irradiation 


Epidermal 
cell 


Melanin 


ora dinucleotide CC-to-TT substitution in the 
core promoter region of TERT (Fig. 1). All three 
mutations are suggestive of damage induced by 
ultraviolet (UV) light; UV light is the primary 
environmental risk factor for melanoma, and 
an abundance of such ‘UV signature’ mutations 
is known in this skin cancer**. The authors also 
identified these TERT-promoter mutations at 
a frequency of about 16% in a panel of cell lines 
derived from a range of other cancers, with a 
somewhat higher than average incidence in 
bladder and liver cancers. 

Horn and colleagues began their study by 
examining the genetics of a large family to 
find a germline (inherited) mutation that pre- 
disposed four generations of family members 
to melanoma. Using gene linkage analysis 
followed by targeted sequencing, they identi- 
fied multiple sequence variants linked to the 
disease, one of which was a T-to-G mutation 
in the TERT promoter. The researchers then 
screened melanoma cells from unrelated 
patients for other examples of this mutation, 


Figure 1 | Promoter mutations create new transcription-factor binding sites. Cells called melanocytes 
reside in the epidermal layer of the skin and provide surrounding skin cells with the pigment melanin to 
protect them from the damaging effects of ultraviolet (UV) light. Huang et al.' and Horn et al. identify 
‘UV signature’ mutations in the promoter region of the gene encoding TERT, a subunit of the enzyme 
telomerase. Strikingly, four mutations identified by the authors all result in nucleotide sequences that 
contain a binding site (red) for transcription factors of the ETS family. Binding of these proteins leads to 
increased TERT expression. Horn et al. also describe a UV-independent germline mutation in the TERT 
promoter in cells from members of a melanoma-prone family that similarly generates a binding site for 


ETS transcription factors. 
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but they instead found a high frequency of 
UV-signature mutations in the TERT pro- 
moter, including those identified by Huang 
et al. and an additional CC-to-TT double 
substitution. 

Until now, BRAF has been the gene most 
frequently identified as mutated in melanomas 
and nevi (commonly known as moles), but 
the combined frequency of TERT mutations 
reported in the present studies is even higher 
than for BRAF. Interestingly, Horn et al. did 
not find the TERT-promoter mutations to be 
acommon feature in naevi, although there is 
insufficient evidence to indicate whether they 
are associated with the progression of benign 
naevi to melanoma. 

Frequent TERT-promoter mutations in 
somatic (non-germline) cells have not been 
identified as driver mutations in cancers 
before, and the genome of melanoma cells has 
a notoriously high somatic mutation rate. So 
it was important for both groups to show a 
functional relevance of the mutations and to 
provide evidence that the mutations did not 
arise by chance. 

A key finding supporting such relevance 
is that all the mutations identified have one 
feature in common: they generate a sequence 
that can be bound by ETS transcription factors 
(Fig. 1), and thus have the potential to con- 
fer additional transcriptional control on the 
TERT promoter. Indeed, this seems to be the 
case — the authors observed a two- to four- 
fold increase in TERT expression when they 
tested the mutated promoter sequence in a 
gene-expression ‘reporter system, compared 
with the wild-type promoter. Further experi- 
ments will be required to determine whether 
this increase is also observed in patients. Addi- 
tionally, neither group found the mutations 
together in the same melanoma, implying that 
the mutations are functionally redundant and 
may each confer a selective advantage, rather 
than occurring by chance owing to a locally 
elevated mutation rate. 

Can a subtle change in TERT-gene expres- 
sion give certain cancer cells a growth advan- 
tage? Telomeres are non-coding sequences at 
the ends of chromosomes that shorten with 
every cell division as a result of incomplete 
DNA replication; telomerase acts to replenish 
these sequences. When the amount of func- 
tional telomerase is reduced, even by only 
half, telomere erosion occurs in yeast, mice 
and humans*. In humans and mice, mutations 
in TERT that result in only a single functional 
copy of the gene adversely affect the function 
of tissues that have a high cell turnover, such as 
skin and blood, and can cause early mortality’. 
Conversely, increased telomerase production 
can promote cancer progression; a dramatic 
example of this is the malignant progression 
of prostate cancer that occurs when telomerase 
activity is restored in a telomerase-negative, 
tumour-prone mouse model’. 

Notably, it seems that melanocytes (the 


skin cells in which melanoma arises) are 
especially sensitive to changes in TERT expres- 
sion, because in Horn and colleagues’ study all 
but one person carrying the T-to-G mutation 
developed melanoma, and the unaffected car- 
rier had multiple naevi. This susceptibility 
could arise because ETS proteins are targets 
of the MAPK cell-signalling pathway, which 
is frequently activated in melanoma and 
benign naevi. In support of this idea, Horn and 
colleagues found that the TERT-promoter 
mutations were found in many of the mela- 
nomas that had activating BRAF mutations; 
BRAF is an intermediate in the MAPK path- 
way, and perhaps increased BRAF signalling 
leads to enhanced activation of ETS factors at 
the TERT promoter. 

Another, non-mutually exclusive possibility 
is that tissue-specific expression of ETS 
proteins causes some cell types to become 
particularly vulnerable to the effects of TERT- 
promoter mutations. Indeed, ETS transcrip- 
tion factors are expressed in melanomas’. 
Furthermore, the T-to-G germline mutation 
identified by Horn et al. generates a bind- 
ing site that may be more selective for Elk1 
and Elk4, a subgroup of ETS that is highly 
expressed in the ovary, and the authors report 
that two women with this TERT-promoter 
mutation also developed ovarian cancer. 

These studies reveal two conceptual 
advances. First, they represent an unusual type 
of driver mutation in cancer — one that need 
not modify the gene product itself, but instead 
creates a binding site for transcriptional activa- 
tors that leads to an increase in normal gene 
expression. Second, they show that changes 
in TERT expression may be sufficient to 
cooperate with cancer-initiating oncogenes 
to drive cancers such as melanoma. A crucial 
next experiment will be to examine the func- 
tional consequences of the increase in TERT 
expression in patients with TERT-promoter 
mutations, to see whether it confers a survival 
advantage through a decrease in the incidence 
of short or fused telomeres that often persist 
in dividing cancer cells”’®. Taken together, the 
findings suggest that cancer development can 
be driven by relatively subtle changes in the 
expression level of a normal gene, indicating 
that trouble can lie upstream of genes as well 
as within them. = 
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A frustrated trio 


Geometrical frustration results from an incompatibility between the spatial 
arrangement of a system’s constituents and the interactions between them. 
The effect has now been observed in a triangular triple quantum dot. 


SABINE ANDERGASSEN 


rustration arises when competing 
Piisescion cannot be satisfied simul- 

taneously. This compromise is central 
to the behaviour of many complex systems, 
from neural networks to folding proteins and 
magnetic materials. Writing in Physical Review 
Letters, Seo et al.' describe experiments on a 
simple frustrated system that make a promising 
step towards our understanding of the mecha- 
nisms governing the physics of frustration. 

In condensed-matter physics, frustration 
is a feature of magnetic materials that have 
competing interactions between localized 
magnetic moments (spins). In a ferromagnet, 
the spins align with one another and result in 
a macroscopic magnetic moment. By contrast, 
in an antiferromagnet, neighbouring spins 
point in opposite directions and the system 
exhibits no such macroscopic moment. An 


Gate electrodes 


interesting situation arises when the system's 
lattice geometry inhibits the formation of an 
ordered, low-temperature spin configuration. 
The origin of this effect is remarkably simple, 
and can be illustrated by three spins interacting 
antiferromagnetically on a triangular lattice. 
Once two of the spins on an elementary trian- 
gle are anti-aligned, the third one is ‘frustrated’: 
it can no longer point ina direction opposite to 
both of the other spins. This incompatibility of 
antiferromagnetic interactions with the under- 
lying lattice geometry is known as geometrical 
frustration, and is particularly pronounced 
in low dimensions, in which the effects of 
interactions prevail. 

The simple, and at the same time, funda- 
mental model of geometrical frustration in a 
triangle exhibits six competing lowest-energy 
spin configurations with the same energy 
(degenerate ground states). The resulting 
fluctuations between these states suppress 


Quantum dot 


Figure 1 | Geometrical frustration in a triangle of three quantum dots. Ina triangular triple quantum 
dot in which the dots’ electron spins (arrows) interact antiferromagnetically, if two of the quantum dots 
are in the ‘up’ and ‘down’ spin states, respectively, owing to the triangular geometry of the system, the spin 
of the third quantum dot is frustrated as it can align opposite to only one of the other spins. As a result, 
there are six configurations (all possible combinations of spin-up and spin-down except for the two in 
which all three spins align) with the same energy. By controlling the system with gate electrodes, Seo and 


colleagues’ observe this sixfold degeneracy. 
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conventional ordering and the system features 
non-zero entropy even at zero temperature, 
in contrast to the naive expectation from 
the third law of thermodynamics. Quantum 
superposition of degenerate states leads to 
the emergence of exotic many-body phenom- 
ena’ °, from spin ice and quantum spin liquids 
to high-temperature superconductivity. 

A key characteristic of frustrated systems is 
a large number of degenerate ground states. 
This high degeneracy is very sensitive to per- 
turbations: any slight asymmetry in the system 
will reduce the number of degenerate ground 
states. This instability leads to rich physical 
behaviour. Understanding the fundamental 
mechanisms and principles underlying the 
variety of quantum phenomena that arise from 
highly degenerate ground states is of concep- 
tual importance in condensed-matter physics. 
Both the development of numerical-simula- 
tion techniques and the increasing ability to 
experimentally engineer systems in which 
magnetic, charge or vibrational degrees of free- 
dom interact, have a central role in address- 
ing this problem. In particular, optical lattices 
(arrays of interfering laser beams) of cold 
atoms’ ’ provide model systems for strongly 
interacting many-body systems. 

Quantum dots (artificial atoms) provide 
a promising platform for investigating non- 
equilibrium charge transport, and in particular 
how it is affected by geometrical frustration. 
Advanced techniques for fabricating nanoscale 
devices allow detailed modelling and flexible 
tuning of parameters. In semiconductor quan- 
tum dots, the potential in which the dots are 
confined and the number of electrons, as well 
as the interaction between them, are controlled 
by metallic ‘gate’ electrodes. 

In their study, Seo et al. report the first obser- 
vation of geometrical frustration in quantum 
dots. Because of the difficulties in building 
highly symmetrical devices, this had not been 
achievable previously. Their frustrated quan- 
tum system consists of three quantum dots 
arranged in a triangle (Fig. 1). Antiferromag- 
netic interactions between the dots’ electron 
spins lead to a frustrated ground state, which 
results from the competition of the six degen- 
erate, three-spin configurations out of eight 
possible spin arrangements. Because control- 
ling electron spins in quantum dots is difficult, 
the authors used an alternative way of study- 
ing geometrical frustration: degenerate charge 
states. The advantage of this approach is that 
these charge states (isospins) are defined by 
the parity of the number of charge carriers 
in the dots and can be precisely controlled by 
metallic gate electrodes. The authors’ measure- 
ments of charge transport revealed the six-fold 
degeneracy of the isospin configurations in the 
maximum electrical conductance induced by 
the fluctuations between the states. 

Seo and colleagues’ work sheds light on the 
impact of frustration on charge transport and 
its characteristic signatures, and motivates 


further experimental as well as theoretical 
studies, including analysis of the time evolu- 
tion of frustrated quantum systems. Quan- 
tum dots have been suggested as potential 
platforms for implementing spin or charge 
quantum bits'®* for quantum computation 
and information processing. Control of mag- 
netism on the atomic scale is also becoming 
essential as data-storage devices are miniatur- 
ized. In particular, switchable nanoscale anti- 
ferromagnets are being discussed as candidate 
building blocks for future memory, storage and 
‘spintronic’ applications’. Understanding the 
underlying physics will be crucial for the devel- 
opment of these technologies. The challenge 
is the scalability to large systems — to explore 
complex many-body phenomena, identify 
novel quantum phases and design interesting 
quantum materials. = 


Sabine Andergassen is in the Faculty of 
Physics, University of Vienna, 1090 Vienna, 
Austria. 


MOLECULAR BIOLOGY 


e-mail: sabine.andergassen@univie.ac.at 


1. Seo, M. et al. Phys. Rev. Lett. 110, 046803 
(2013). 

Bramwell, S. T. & Gingras, M. J. P. Science 294, 
1495-1501 (2001). 

3. Ramirez, A. Nature 399, 527-528 (1999). 

4. Balents, L. Nature 464, 199-208 (2010). 
5 
6 


N 


. Pratt, F. et al. Nature 471, 612-616 (2011). 

. Bramwell, S. T. et al. Nature 461, 956-959 

(2009). 

7. Bloch, |. Nature 453, 1016-1022 (2008). 

8. Bloch, |., Dalibard, J. & Zwerger, W. Rev. Mod. Phys. 

80, 885-964 (2008). 

9. Bloch, |., Dalibard, J. & Nascimbéne, S. Nature Phys. 

8, 267-276 (2012). 

O.Loss, D. & DiVincenzo, D. P. Phys. Rev. A 57, 

120-126 (1998). 

.Petta, J. R. et al. Science 309, 2180-2184 
(2005). 

2.Koppens, F. H. L. et a/. Nature 442, 766-771 
(2006). 

3.Hanson, R., Kouwenhoven, L. P, Petta, J. R., 
Tarucha, S. & Vandersypen, L. M. K. Rev. Mod. Phys. 
79, 1217-1265 (2007). 

4.Pioro-Ladriére, M. et al. Nature Phys. 4, 776-779 
(2008). 

5.Loth, S., Baumann, S., Lutz, C. P, Eigler, D. M. & 
Heinrich, A. J. Science 335, 196-199 (2012). 


a 


Circles reshape 
the RNA world 


The versatility of RNA seems limitless. The latest surprise comes from 
circular RNAs, which are found to counteract the function of another class 
of regulatory RNA — the microRNAs. SEE ARTICLE P.333 & LETTER P.384 


KENNETH S. KOSIK 


he protein-coding function of mes- 
senger RNAs can be suppressed by the 
binding of short microRNA sequences. 
But how microRNA-induced suppression is 
itself inhibited is poorly understood. In this 
issue, Memczak et al.' (page 333) and Hansen 
et al.’ (page 384) describe highly stable, circular 
RNAs that bind several copies of a microRNA 
to terminate suppression of mRNA targets”. 
The circular RNA (circRNA) reported, 
called CDRlas by Memczak et al. and ciRS-7 
by Hansen et al., contains roughly 70 evolu- 
tionarily conserved binding sites for micro- 
RNA-7 (miR-7) and forms a complex with 
AGO proteins. The latter are part of the RNA- 
induced silencing complex, which allows 
miRNAs to recognize their target mRNAs. 
When Memczak and colleagues expressed 
human CDR1as/ciRS-7 in zebrafish embryos, 
its effects were the same as those seen when 
miR-7 expression was reduced — impaired 
midbrain development. Moreover, the authors’ 
bioinformatic predictions indicated that 


*This article and the papers under discussion!* were 
published online on 27 February 2013. 
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thousands of circRNAs reside in the genome, 
consistent with previous reports**. 

Target suppression by miRNAs is highly 
nuanced. On the one hand, these sequences 
can induce AGO-mediated endonucleolytic 
mRNA cleavage triggered by complemen- 
tarity between the mRNA and the miRNA 
at nucleotides 10 and 11. The destruction of 
the target, which follows, frees the miRNA to 
bind to its next target in a catalytic manner. On 
the other hand, miRNAs can inhibit protein 
translation by binding more stably to a tar- 
get mRNA in a stoichiometric manner. This 
makes the target a ‘reservoir’ that prevents the 
miRNA from inhibiting other mRNA targets. 
The latter mechanism is an indication of the 
way in which competing endogenous RNAs 
(ceRNAs) act. These are mRNAs that share 
miRNA-response elements (MREs) with other 
mRNAs and so compete for binding to those 
miRNAs with which they also share MREs’. 

Like ceRNAs, circRNAs serve as miRNA 
reservoirs. However, circRNAs have numerous 
binding sites for a specific miRNA and so are 
completely dedicated to their role of harbour- 
ing miRNAs. Binding ofa miRNA toa ceRNA 
not only prevents that miRNA from binding to 
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mRNA targets 


' 


ceRNAs, 
pseudogene decoys, 
miRNA mimics 


Figure 1 | Constraints on evolutionary change in microRNAs. MicroRNAs (miRNAs) lie in a fitness 
valley constrained by their numerous interactions, which include those with the hairpin structure of the 
precursor miRNA (pre-miRNA), the many target mRNAs and other RNAs that terminate or modulate 
miRNA binding to target sequences by competing against them. The latter category includes competing 
endogenous RNAs (ceRNAs), pseudogene decoys and miRNA mimics. Two studies’ introduce circular 
RNAs (circRNAs) as another constraining factor. MRE, miRNA-response element. 


other MREs, but can also suppress translation 
from the coding portion of the ceRNA. Hence, 
compared with circRNAs, ceRNAs operate ina 
more complex weave of interacting molecules 
that constrains translation. Other reservoirs of 
target sites also reside on distinct molecules. 
These include target mimics such as the 
IPS1 gene in the plant Arabidopsis thaliana’, 
decoys within pseudogenes such as PTENP1 
(ref. 7) and possibly 3’-untranslated regions of 
mRNA that are expressed separately from their 
associated protein-coding sequences’. 

Circularizing RNA enhances its stability 
by obviating a role for RNA exonuclease 
enzymes, which act on free 3’ and 5’ ends of 
an RNA molecule to cleave it. Moreover, with 
several binding sites dedicated to antagoniz- 
ing a single miRNA, a circRNA can capture 
miRNAs from numerous targets in one fell 
swoop. Likewise, circRNA destruction could 
release a shower of miRNAs that target mul- 
tiple mRNAs with the shared MRE. In fact, 
Hansen et al. outline a circRNA-destruction 
mechanism in which miR-671 binds CDRlas/ 
ciRS-7 with greater complementarity than 
miR-7 and induces AGO-mediated cleavage 
of this circRNA. 

Snapshot approaches to profiling miRNAs 
reveal that the greatest changes in their expres- 
sion levels occur at transition points in devel- 
opment, cell differentiation or carcinogenesis’. 
Clearance of the mRNA-miRNA duplexes at 
these points and their replacement with differ- 
ent miRNAs could operate through circRNAs. 
For instance, as a brute means of vacuum- 
ing up miRNAs, circRNAs could increase in 
expression as cell differentiation from stem 
cells proceeds, to capture the exceedingly high 
levels of miRNAs expressed in stem cells. They 


could also clean up the opposite strands of 
mature miRNAs, which can be present in sur- 
prisingly large numbers”, or potentially func- 
tion therapeutically to divert cancer-associated 
miRNAs from promoting an oncogenic path- 
way. In all these cases, however, the circRNAs 
sequester miRNAs, and so a knowledge gap 
remains regarding how miRNAsare destroyed. 

To function optimally, the number of 
miRNA-binding sites on each circRNA is 
probably under selection pressure to attract 
nearly all of a specific miRNA population 
from all of its target sites. If so, the number of 
miRNA-binding sites on a circRNA multiplied 
by the number of copies of the circRNA ina 
single cell will inform us about the collective 
strength of all MREs for a particular miRNA. 
Many miRNAs operate at copy numbers of 
10° per cell — a likely lower boundary for the 
number of circRNA sites required to mop 
them all up. However, for circRNAs to win 
out against mRNA targets in the competition 
for miRNA binding, they must have a greater 
affinity for the miRNAs. High affinity can be 
thermodynamically built into the circRNA 
sequence, but may also require an excess of 
circRNA-encoded miRNA-binding sites rela- 
tive to the total number of other relevant MREs 
in the cell. Certainly, modellers will soon be 
romping through this territory. 

The fitness ‘landscape that has contributed 
to maintaining each of the roughly 21-nucle- 
otide miRNAs unchanged over major parts 
of evolution includes circRNAs (Fig. 1). The 
selection pressure on each miRNA nucleotide 
is undoubtedly high: an miRNA sequence 
must base-pair to itself to form the hairpin- 
shaped precursor miRNA; it must pair with a 
host of target mRNAs; and it must pair with 
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binding sites that terminate or modulate target 
interaction. Despite the enormous number 
of possible miRNA sequences, the small 
amount of change in miRNAs implies that 
the remaining evolutionary space for innova- 
tion is limited; in other words, miRNAs have 
approached molecular perfection. Throughout 
animal evolution, nature has tinkered with the 
sequences ofa relatively constant set of coding 
genes, whereas miRNA innovation, in general, 
is more reliant on the invention of completely 
novel sequences’. Perhaps the ease with which 
hairpin-shaped miRNA precursors can arise 
as potential regulatory elements — and fit 
‘digitally into a wealth of genomic non-coding 
sequence, including circRNAs — could serve 
as a driver of evolution. 


As a footnote, a better naming system for 
circRNAs is needed. ‘ciRS-7’ denotes bind- 
ing to miR-7, and therefore assumes that 
other circRNAs in this category will also 
neatly align with a single miRNA. ‘CDRlas’ 
assumes that circRNAs will bear some rela- 
tionship to a named gene — in this case, an 
antisense sequence to the cerebellar degener- 
ation-related gene. With thousands of these 
circRNAs in the genome, they require their 
own numbering system. My suggestion is that 
this one is called circR-1. m 


Kenneth S. Kosik is at the Neuroscience 
Research Institute and in the Department 
of Molecular, Cellular and Developmental 
Biology, University of California, 


The ALMA telescope 
shows its true colours 


Bright, gravitationally magnified galaxies have been found across a wide span 
of cosmic time. The first results from the still- growing ALMA telescope show its 
power to reveal these galaxies’ redshifts and internal structure. SEE LETTER P.344 


ANDREW W. BLAIN 


he Atacama Large Millimeter/submilli- 
"Tone Array (ALMA) interferometer, 

which is nearing completion in Chile’, 
is revolutionizing observational astronomy. It 
provides precise views of the internal work- 
ings of galaxies by imaging the gas and dust 
in their interstellar medium at wavelengths of 
0.3-3 millimetres. On page 344 of this issue, 
Vieira et al.” present some of the first detailed 
images and measurements of redshifts of dis- 
tant galaxies obtained by ALMA (Fig. 1). The 
authors targeted galaxies discovered in a wide- 
field survey using the South Pole Telescope’, to 
find those that are magnified — and so made 
unusually easy to study — by the ‘gravitational 
lensing’ effect of foreground objects*. 

The steady discovery of new classes of 
galaxy, as different windows on the cosmos 
have become accessible, has been a highlight 
of astrophysics for decades. The radio, X-ray 
and far-infrared windows have opened to 
reveal ever more about the processes at work 
in both long-known and previously unappreci- 
ated types of galaxy. By combining the power 
of wide-field surveys to discover galaxies and 
the ability to dissect the galaxies’ nature using 
high-resolution imaging and spectroscopy, an 
improved understanding of their properties 
can be obtained rapidly and efficiently. 


The heritage of this process dates back to the 
1920s with the spectroscopy of diffuse nebu- 
lae obtained using the largest optical telescope 
then available, which led to the measurement 
of the expansion rate of the Universe. Then 
came the recognition of radio sources and 
other active galactic nuclei (AGN) in the 1960s, 
unambiguously showing cosmic evolution 
taking place. Wide-field, space-based surveys 
using the IRAS infrared and ROSAT X-ray sat- 
ellites several decades ago, and more recently 
the far-infrared Herschel Space Observatory, 
mid-infrared WISE and millimetre-wave- 
length Planck spacecraft, have found classes 
of unusual and rare galaxies, and clear clues 
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to the general process of galaxy formation and 
evolution. Ambitious wide-field surveys from 
the ground, ranging from the Sloan Digital 
Sky Survey to the recently started Dark Energy 
Survey, are also driving our view of the proper- 
ties of galaxies, and enabling unprecedentedly 
accurate and complete statistical studies. 

A fresh opportunity to discover unusual 
galaxies, magnified by the gravitational influ- 
ence of foreground objects, has just been 
provided by a combination of the wide-area, 
millimetre-wavelength surveys obtained with 
the South Pole Telescope (SPT) and the imag- 
ing and spectroscopic power of ALMA at the 
galaxies’ pinpointed locations. Vieira and 
colleagues take this opportunity in their study. 
Their results come from an SPT survey of 
1,300 square degrees, an area of sky that blocks 
out about the same angle as that subtended by 
a laptop screen in front of its user. The SPT 
takes images of the sky at three wavebands in 
the millimetre range, with the primary objec- 
tive of identifying distant clusters of galaxies 
by means of the Sunyaev-Zeldovich effect — 
the characteristic spectral signal imprinted in 
these bands on the cosmic microwave back- 
ground relic radiation from the Big Bang, 
due to the scattering of this radiation by hot 
electrons in the clusters’. 
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*This article and the paper under discussion? were . ; 
Figure 1 | The ALMA array in the Atacama Desert of northern Chile. 


published online on 13 March 2013. 
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In parallel, the SPT also detects a bright 
sample of far-infrared-luminous galaxies, 
which are shining because their interstellar 
medium absorbs visible and ultraviolet light 
from stars and AGN. Their measured colours 
in the SPT images are used to weed out gal- 
axies with strong radio emission from AGN, 
and galaxies that are sufficiently near to us to 
have been catalogued using the IRAS satel- 
lite. This leaves an efficiently selected, reliable 
catalogue of distant, bright galaxies, which are 
typically magnified by the gravitational lens- 
ing (or light-bending) effect of foreground 
objects*. The astrophysics of these galaxies 
can be studied in great detail, if a suitable tool 
is available to detect and resolve their internal 
structure’, by taking advantage of the lens- 
ing to increase both their apparent size on 
the sky and their total emitted power. Care- 
ful modelling can provide a view of details 
that would be beyond the reach of even weeks 
of observations without the effect of this 
‘gravitational telescope. 

Vieira et al. showcase the power of ALMA 
by making images and spectra of 47 galaxies 
detected in their SPT survey. They use two 
techniques: imaging at relatively short, 1-mm 
wavelengths to reveal the ‘sheared’ and multiply 
imaged views of the galaxies formed by gravi- 
tational lensing, and spectral scanning across 
the ALMA waveband at around 3 mm, to find 
the galaxies’ redshifts from line emission com- 
ing from their interstellar-medium gas. In a 
large fraction (about 88%) of the 26 spectral- 
scanning observations, secure serendipitous 
redshifts were found from multiple spectral 
lines. The measured redshift range (2-5.7) 
corresponds to a substantial fraction of the his- 
tory of the Universe, with the earliest objects 
dating back to one billion years after the Big 
Bang. Furthermore, the forthcoming ability 
of ALMA to measure the internal motion of 
the gas ina substantial sample of distant galax- 
ies, using spatially resolved spectral measure- 
ments, to reveal their shapes, sizes, rotation 
speeds, masses and any merging subunits, is 
unprecedented. By exploiting lensing, the time 
required to detect and resolve these targets 
using ALMA is reduced to minutes, which is 
ideally suited to exploit the precious time avail- 
able for science as the instrument is brought 
into full service. The lensing effect may subtly 
bias the sample towards higher redshifts, but 
this can be addressed by careful analysis of 
larger samples from the ongoing SPT surveys’. 

There is nowa steady stream of distant, far- 
infrared-luminous, dust-enshrouded galaxies 
that ALMA can target. The fruits of galaxy 
surveys by the Herschel Space Observatory, 
now entering the last months of its multi-year 
mission, are available’, as are those from all- 
sky surveys made using the Planck and WISE 
missions, to catch the most extreme examples 
of the galaxy population in the act of forma- 
tion. New facilities on the ground, including 
the SCUBA-2 camera on the James Clerk 


Maxwell Telescope in Hawaii® and new cam- 
eras becoming available at the IRAM 30-m 
telescope in Spain, are surveying at shorter 
wavelengths than the SPT, bridging the gap 
between the millimetre-wavelength SPT and 
the far-infrared Herschel. These observations 
will yield additional targets, and the Ata- 
cama Cosmology Telescope in Chile’ is also 
conducting a suite of complementary milli- 
metre-wavelength surveys. The sensitivity 
and wide field of view of the planned CCAT 
telescope” will ultimately advance the preci- 
sion and resolution of wide-field far-infrared/ 
submillimetre-band surveys well beyond these 
current survey facilities. 

The promise of ALMA for unravelling the 
findings of wide-field dusty-galaxy surveys, as 
conducted by the SPT, has now been clearly 
demonstrated. It is exciting to see the results, 
but they are a mere prelude to the final capa- 
bilities of this awesome machine — the first 
global-scale international observatory that 
has been constructed by a genuinely equal 
partnership between parties on different 
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continents. With a spatial resolution finer 
than that of the Hubble Space Telescope and 
an ‘always-on’ spectroscopic capability, ALMA 
can probe an otherwise invisible fraction of the 
energy emitted by galaxies, to provide a much 
more complete view of galaxy dynamics and 
evolution, and the processes that form stars 
and fuel AGN. = 
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Detecting selection 


Advances in population genetics and genome sequencing have made it possible to 
identify anonymous fragments of DNA that have undergone selection. This yields 
some evolutionary answers, and a panoply of puzzles. SEE LETTER P.360 


GREGORY S. BARSH & LEIF ANDERSSON 


hen an affenpinscher named 
Banana Joe took the top honours at 
last month’s Westminster Kennel 


Club dog show in New York, the judge referred 
to his “fantastic face [and] great body”. Some 
of Joey's success may be attributed to the awe- 
some power of artificial selection, but we 
sometimes overlook the fact that dogs are 
also our companions in natural selection, and 
have adapted to similar changes in lifestyle and 
nutrient availability over the past 10,000 years 
or so. 

The parallel evolution of humans and our 
animal companions is apparent in a study by 
Axelsson et al.! on page 360 of this issue, in 
which the authors used a population-genomics 
approach to identify regions of the dog genome 
that have undergone selection during domes- 
tication. This method promises to revolution- 
ize evolutionary biology, by challenging us to 
detect traits affected by evolution on the basis 
of genotype rather than an organisms char- 
acteristics, or phenotype. Two other recent 
papers, published in Cell by Grossman et al.’ 
and Kamberov et al.’, rise to this challenge 
and show how hypotheses about an adaptive 
human genotype can be tested in controlled 


experiments. Together, the three papers are 
a wonderful intersection between genomics, 
population science and experimental genet- 
ics — a synergy that has tremendous potential 
for teaching us more about how and why 
organisms evolve. 

At first glance, the approach taken by 
Axelsson and colleagues is simple: to under- 
stand what distinguishes domesticated 
dogs from their wolf ancestors, the authors 
sequenced the genomes of 12 wolves and 
60 dogs and identified DNA fragments that 
show little variation among dogs, but contain 
ahigh density of differences between dogs and 
wolves (a selective signature). They then deter- 
mined which genes lie in those fragments. 

But the devil is in the details of the dog DNA 
— the features that underlie a selective signa- 
ture are exactly the same features that occur 
after a population bottleneck (the reduction in 
genetic diversity that occurs when the size of a 
population is rapidly and drastically reduced). 
To partition DNA fragments selected for dur- 
ing 10,000 years of domestication from those 
caused by population bottlenecks during the 
past several hundred years of breed formation, 
Axelsson et al. created a kind of ‘virtual mutt 
by choosing 60 dogs that represent 14 diverse 
breeds and analysing these data as a single large 
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Figure 1 | A dog’s breakfast. The winner of the prestigious Westminster Kennel Club dog show 
traditionally enjoys a celebratory meal at Sardi’s Restaurant in New York (the 2012 winner, Malachy, 
is shown here). Axelsson and colleagues' show that dogs are especially well suited to a diet of complex 
carbohydrates, as a result of selection pressures exerted during thousands of years of domestication. 


pool. What should emerge from this analysis 
are not genomic regions associated with herd- 
ing, hunting or pointing, but regions that dis- 
tinguish all dogs from their wild ancestors. The 
authors identified several functional categories 
of genes enriched in dog ‘candidate domestica- 
tion regions. They chose to further investigate 
those involved in starch metabolism, reason- 
ing that domestication was, in part, a conse- 
quence of the agricultural revolution and was 
therefore facilitated by adaptation to a starch- 
rich diet (Fig. 1). 

Indeed, Axelsson et al. report evidence 
for gain-of-function alterations in dog genes 
encoding proteins that break down complex 
carbohydrates (AMY2B), hydrolyse oligo- 
saccharides (MGAM) and help to transport 
glucose across the intestinal wall (SGLT1). The 
AMY2B result is particularly notable because it 
seems that the increased activity of the enzyme 
it encodes, amylase, has occurred as a result of 
an increase in the number of copies of AMY2B. 
Gene amplification is also known to underlie’ 
the increased activity of human amylase in 
populations in which ancestors consumed 
diets rich in starch. Thus, the same molecu- 
lar mechanism has acted on similar genes in 
different species exposed to the same dietary 
pressure — a striking example of parallel 
evolution. 

The authors of the Cell papers have taken a 
different tack. They focus on ways to analyse 
the flood of human sequence data that has 
resulted from the next-generation sequencing 
revolution. In cases in which it is possible to 
determine whether adjacent sequence vari- 
ants lie on the same chromosome or homolo- 
gous chromosomes, the range and complexity 
of potential analytical approaches expand*® 
and, together with data on the frequency of 


specific sequence variants, these approaches 
can sometimes be used to pinpoint causative 
variants that underlie recent selective sweeps’. 
(A selective sweep refers to the reduction in 
genetic variation that occurs in regions adja- 
cent to a mutation that confers a strong selec- 
tive advantage.) Grossman et al.’ applied this 
approach to human sequence data from the 
1000 Genomes Project to generate a compre- 
hensive catalogue of several hundred regions of 
around 30 kilobases long, each of which con- 
tains dozens of potential adaptive mutations. 

The thousands of selected variants that 
emerge from such analyses highlight the 
challenge of ‘reverse evolutionary genetics’ 
— determining which phenotypic change 
has been brought about by a specific selected 
region. Even when causal relationships seem 
obvious, caution is warranted. For example, the 
KITLG gene, which encodes a crucial signal- 
ling molecule for the migration of melanocytes 
(skin cells that produce the pigment mela- 
nin), bears a strong signature of selection in 
European populations, but was not detected 
as a skin-colour gene in a recent associa- 
tion study of an African—European admixed 
population®. 

Classical genetic studies are the optimal way 
to establish causal relationships, but in many 
cases these are impossible because the appro- 
priate populations do not exist. Kamberov 
et al.” suggest a way forward using a model 
organism. Their study focused on a well- 
recognized mutation in the human gene 
that encodes the ectodysplasin A receptor 
(EDAR), a signalling molecule that has a role 
in the development of hair, teeth and exocrine 
glands’. The mutation, which occurred around 
30,000 years ago in East Asian populations, 
results in a valine-to-alanine substitution at 
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amino-acid-residue 370 of the protein. It has 
been associated with increased hair thick- 
ness’’, and several lines of evidence suggest 
that it may have other phenotypic effects’. 
Kamberov et al. genetically engineered this 
mutation in mice, and found that the animals 
had thicker hair, altered mammary-gland mor- 
phology and an increased density of eccrine 
sweat glands compared with normal mice. The 
authors then closed the phylogenetic circle by 
showing that the mutation is also associated 
with an increased number of eccrine sweat 
glands in a population of Han Chinese. Thus, 
the phenotypic effect of an adaptive human 
variant that arose around 30,000 years ago 
was explored, confirmed and extended to a 
species whose most recent common ancestor 
with humans was 3,000 times older than this. 

Kamberov and colleagues’ study is an excep- 
tional example of experimental genetics, but 
does it provide, as the authors suggest, a gen- 
eral framework for assessing candidate adap- 
tive mutations? Genetically altered mice are 
a powerful experimental tool, but the extent 
to which recent positive selection in humans 
acts on pathways and amino-acid residues 
that have been conserved across mammalian 
evolution is uncertain. More importantly, it is 
often not clear how to investigate positively 
selected genomic regions for which the target 
gene, let alone its action, is unknown. And 
so a major challenge for population genom- 
ics remains the construction of meaningful 
null hypotheses. As Charles Darwin, the best 
known evolutionary biologist, once said”, 
“It is always advisable to perceive clearly 
our ignorance’. m 
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Functional organization of human 
sensorimotor cortex for speech articulation 


Kristofer E. Bouchard’”, Nima Mesgarani’’, Keith Johnson? & Edward F. Chang!*4 


Speaking is one of the most complex actions that we perform, but nearly all of us learn to do it effortlessly. Production of 
fluent speech requires the precise, coordinated movement of multiple articulators (for example, the lips, jaw, tongue and 
larynx) over rapid time scales. Here we used high-resolution, multi-electrode cortical recordings during the production 
of consonant-vowel syllables to determine the organization of speech sensorimotor cortex in humans. We found speech- 
articulator representations that are arranged somatotopically on ventral pre- and post-central gyri, and that partially 
overlap at individual electrodes. These representations were coordinated temporally as sequences during syllable 
production. Spatial patterns of cortical activity showed an emergent, population-level representation, which was orga- 
nized by phonetic features. Over tens of milliseconds, the spatial patterns transitioned between distinct representations 
for different consonants and vowels. These results reveal the dynamic organization of speech sensorimotor cortex during 
the generation of multi-articulator movements that underlies our ability to speak. 


Speech communication critically depends on the ability to produce 
the large number of sounds that compose a given language’’. The 
wide range of spoken sounds results from highly flexible configura- 
tions of the vocal tract, which filters sound produced at the larynx 
through movements of the lips, jaw and tongue that are coordi- 
nated precisely*°. Each articulator has extensive degrees of freedom, 
making a large number of different speech movements possible. How 
humans exert such precise control despite the wide variety of move- 
ment possibilities is a central unanswered question'®”. 

The cortical control of articulation is mediated primarily by the 
ventral half of the lateral sensorimotor (Rolandic) cortex (ventral 
sensorimotor cortex, vVSMC)*”°, which provides corticobulbar pro- 
jections to, and afferent innervation from, the face and vocal tract 
(Fig. 1a, b)’*’*. The U-shaped vSMC is composed of the pre- and post- 
central gyri (Brodmann areas 1, 2, 3 and 6b), and the gyral area 
directly ventral to the termination of the central sulcus called the 
guenon (Brodmann area 43) (Fig. la, b)’*. Using electrical stimu- 
lation, Foerster and Penfield described the somatotopic organization 
of face and mouth representations in human vSsMC'*"*'*. However, 
focal stimulation could not evoke meaningful utterances, implying 
that speech is not stored in discrete cortical areas. Instead, the pro- 
duction of phonemes and syllables is thought to arise from a coordinated 
motor pattern involving multiple articulator representations'’**>”. 

To understand the functional organization of VSMC in articulatory 
sensorimotor control, we recorded neural activity directly from the 
cortical surface in three human subjects implanted with high-density 
multi-electrode arrays as part of their preparation for epilepsy surgery 
(Fig. 1a). Intracranial cortical recordings were synchronized with micro- 
phone recordings as subjects read aloud consonant-vowel syllables 
(19 consonants followed by /a/, /u/ or /i/; Supplementary Fig. 1) that 
are commonly used in American English. This task was designed to 
sample across a range of phonetic features, including different con- 
striction locations (place of articulation) and different constriction 
degrees or shapes (manner of articulation) for a given articulatory 
organ’, 


vSMC physiology during syllable production 


We aligned cortical recordings to acoustic onsets of consonant-to- 
vowel transitions (ft = 0) to provide a common reference point across 
consonant-vowel syllables (Fig. 1c—e). We focused on the high-gamma 
frequency component of local field potentials (85-175 Hz)”°”?”’, which 
correlates well with multi-unit firing rates**. For each electrode, we 
normalized the time-varying high-gamma amplitude to baseline stat- 
istics by transforming to z-scores. 

During syllable articulation, approximately 30 active VSMC elec- 
trode sites were identified per subject (approximately 1,200 mm’, 
change in z-score of greater than 2 for any syllable). Cortical activity 
from selected electrodes distributed along the vSMC dorsoventral 
axis is shown for /ba/, /da/ and /ga/ (Fig. 1c-e, same colouring as in 
Fig. la). The plosive consonants (/b/, /d/, /g/) are produced by tran- 
sient occlusion of the vocal tract by the lips, front tongue and back 
tongue, respectively, whereas the vowel /a/ is produced by a low, back 
tongue position during phonation. Dorsally located electrodes (for 
example, Fig. 1c-e, electrodes 124 and 108; black) were active during 
production of /b/, which requires transient closure of the lips. In 
contrast, mid-positioned electrodes (for example, electrodes 129, 133 
and105; grey) were active during production of /d/, which requires 
forward tongue protrusion against the alveolar ridge. A more ventral 
electrode (for example, electrode 104; red) was most active during 
production of /g/, which requires a posterior-oriented tongue elevation 
towards the soft palate. Other electrodes appear to be active during the 
vowel phase for /a/ (for example, electrodes 154, 136 and 119). 

Cortical activity at different electrode subsets was superimposed to 
visualize spatiotemporal patterns across other phonetic contrasts. 
Consonants produced with different constriction locations of the ton- 
gue tip, (for example, /0/ (dental), /s/ (alveolar), and /|/ (post-alveolar)), 
showed specificity across different electrodes in central VSMC (Fig. 1f), 
although they were not as categorical as those shown for consonants 
involving different articulators in Fig. 1c-e. Consonants with similar 
tongue constriction locations, but different constriction degree or con- 
striction shape, were generated by overlapping electrode sets exhibiting 
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Figure 1 | vSMC physiology during syllable production. a, Magnetic 
resonance imaging (MRI) reconstruction of a single subject brain with 

vSMC electrodes (dots), coloured according to distance from the Sylvian fissure 
(black and red are the most dorsal and ventral positions, respectively). 

b, Expanded view of vSMC anatomy. cs, central sulcus; PoCG, post-central 
gyrus; PrCG, pre-central gyrus; Sf, Sylvian fissure. Scale bars, 1 cm. c-e, 

Top, vocal tract schematics for three consonants (/b/, /d/, /g/), produced by 
occlusion at the lips, tongue tip and tongue body, respectively (red arrow). 
Middle, spectrograms of spoken consonant-vowel syllables. Bottom, average 
cortical activity from a subset of electrodes (electrode number on far right, same 
colouring as in a). Vertical dashed line, acoustic onset of consonant-vowel 
transition. f-h, Cortical activity at selected electrodes for different phonetic 
contrasts (mean + s.e.m.). Acoustic waveforms are displayed above. 

f, Fricatives (/0/(‘th’ of ‘thin’), /s/, /{/ (‘sh’ of ‘shin’)) with different constriction 
locations. g, Front tongue consonants (/I/, /n/, /d/) with different constriction 
degree or shapes. h, Single consonant (/j/ (‘y’ of ‘yes’)) with different vowels 
(/a/, /i/, /u/). Purple arrows correspond to a tongue electrode with prolonged 
activity for /i/ and /u/ vowels. Black arrow corresponds to an active lip 
electrode for /u/. 
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different relative activity magnitudes (Fig. 1g, /I/ (lateral) versus /n/ 
(nasal stop) versus /d/ (oral stop)). Syllables with the same consonant 
followed by different vowels (Fig. 1h, /ja/, /ji/, /ju/) were found to have 
similar activity patterns before the consonant-vowel transition. During 
vowel phonation, a dorsal electrode is clearly active during /u/, but not / 
i/ or /a/ (Fig. 1h, /ju/; black arrow) whereas another electrode in the 
middle of vSMC had prolonged activity during /i/ and /u/ vowels 
compared to /a/ (Fig. 1h, /ji/ and /ju/; purple arrows). These contrast- 
ing examples show that important phonetic properties can be observed 
qualitatively from the rich repertoire of VSMC spatiotemporal patterns. 


Spatial representation of articulators 

To determine the spatial organization of speech-articulator represen- 
tations, we examined how cortical activity at each electrode depended 
on the movement of a given articulator (using a general linear model). 
Weassigned binary variables to four articulatory organs (lips, tongue, 
larynx and jaw) that are used in producing the consonant component 
of each consonant-vowel syllable (Supplementary Fig. 1). The spatial 
distribution of optimal weightings for these articulators (averaged 
over time and subjects) were plotted as a function of dorsoventral 
distance from the Sylvian fissure and anteroposterior distance from 
the central sulcus. We found representations for each articulator dis- 
tributed across vSMC (Fig. 2a). For example, the lip representation 
was localized to the dorsal aspect of vVSMC, whereas the tongue 
representation was distributed more broadly than the lip representa- 
tion across the ventral aspect. 

To determine topographic organization of articulators across sub- 
jects, we extracted the greatest 10% of weightings from individual 
articulator distributions (Fig. 2a) and used a clustering algorithm (k- 
nearest neighbour) to classify the surrounding cortex (Fig. 2b). We 
found an overall somatotopic dorsoventral arrangement of articulator 
representations laid out in the following sequence: larynx, lips, jaw, 
tongue and larynx (Fig. 2a, b and Supplementary Figs 2-5). An analysis 
of the fractional representation of all articulators at single electrodes 
showed a clear tuning preference for individual articulators at single 
electrodes and also demonstrated that single electrodes had functional 
representations of multiple articulators (Supplementary Fig. 6). 


Timing of articulator representations 

As the time course of articulator movements is on the scale of tens of 
milliseconds, previous approaches have been unable to resolve tem- 
poral properties associated with individual articulator representations. 
We examined the timing of correlations between cortical activity 
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Figure 2 | Spatial representation of articulators. a, Localization of lips, jaw, 
tongue and larynx representations. Average magnitude of articulator 
weightings (colour scale) plotted as a function of anteroposterior (AP) distance 
from the central sulcus and dorsoventral (DV) distance from the Sylvian fissure 
(n = 3 subjects). b, Functional somatotopic organization of speech-articulator 
representations in vSMC. Lips (L, red); jaw (J, green); tongue (T, blue); larynx 
(X, black); mixed (yellow). Letters correspond to locations, based on direct 
measurement-derived regression weights; shaded rectangles correspond to 
regions classified by k-nearest neighbour. 
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and specific consonant articulators (using partial correlation analysis), 
and included two vowel articulatory features (back tongue and high 
tongue; Supplementary Fig. 1). 

Time courses of correlations were plotted for electrodes with high- 
est values, sorted by onset latency (Fig. 3a). We found that jaw, high 
tongue and back tongue had very consistent timing across electrodes. 
Similar results were found for tongue, lips and larynx, but with more 
variable latencies. Timing relationships between articulator represen- 
tations were staggered, reflecting a temporal organization during 
syllable production: lip and tongue correlations began well before 
sound onset (Fig. 3a, c, d); jaw and larynx correlations were aligned 
to the consonant-vowel transition (Fig. 3a, c, d); and high tongue and 
back tongue features showed high temporal specificity for the vowel 
phase, peaking near the acoustic mid-point of the vowels (approxi- 
mately 250 ms, Fig. 3b-d). This sequence of articulator correlations 
was consistent across subjects (Fig. 3d, P< 10 '°, analysis of variance 
(ANOVA), F = 40, df. = 5, n = 211 electrodes from 3 subjects) and 
is in accordance with the timing of articulator movements shown in 
speech-kinematics studies**'”**. We found no statistically significant 
onset-latency differences in those areas 10 mm anterior and posterior 
to the central sulcus) or across the guenon (P > 0.4, rank-sum test; 
n = 71 and n = 67, respectively; Supplementary Fig. 7). This is con- 
sistent with mixed sensory and motor orofacial responses throughout 
vSMC, which are also seen in stimulation experiments’*”’. 


Phonetic organization of spatial patterns 


The distributed organization of speech articulator representations 
(Fig. 2) led us to propose that coordination of the multiple articulators 
required for speech production would be associated with spatial 
patterns of cortical activity. We refer here to this population-derived 
pattern as the phonetic representation. To determine its organiza- 
tional properties, we used principal component analysis to transform 
the observed cortical activity patterns into a ‘cortical state-space’ 
(approximately 60% of variance is explained by 9 spatial princi- 
pal components for all subjects, Supplementary Figs 8 and 9)?*°°. 
k-means clustering during the consonant phase (25 ms before the 
consonant-vowel transition, t= —25ms) showed that the cortical 
state-space was organized into three clusters (quantified by silhouette 
analysis) corresponding to the major oral articulators: labial, coronal 
tongue, and dorsal tongue (Fig. 4a and Supplementary Fig. 10). 
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Figure 3 | Temporal representation of articulators. a, b, Timing of 
correlations between cortical activity and consonant (a) and vowel 

(b) articulator features. Colour maps display correlation coefficients (R) for a 
subset of electrodes. c, Acoustic landmarks. Onset (end of arrows, left), peak 
power (shown by a dot in each case) and offset (end of arrows, right) for 
consonant-vowel syllables (mean + s.e.m.,n = 168 syllables, all subjects). Error 
bars are smaller than the symbols. d, Temporal sequence and range of 
correlations. Symbols are as in c. Data are mean (symbols) + s.e.m. (thick solid 
line) across electrodes from all subjects. 
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During the vowel phase (250 ms after the consonant-vowel transition, 
t = 250), we found clear separation of /a/, /i/ and /u/ vowel states 
(Fig. 4b). Similar clustering of consonants and vowels was found 
across subjects (P<10 '° for clustering of both consonants and 
vowels, Supplementary Fig. 11). 

Theories of speech motor control and phonology have speculated 
that there is a hierarchical organization of phoneme representations, 
given the anatomical and functional dependencies of the vocal tract 
articulators during speech production**’”"**', To evaluate such 
organization in VSMC, we applied hierarchical clustering to the cor- 
tical state-space (Fig. 4c, d). For consonants, this analysis confirmed 
that the primary tier of organization was defined by the major oral 
articulator features: dorsal, labial or coronal (Fig. 4c). These major 
articulators were superordinate to the constriction location within 
each articulator. For example, the labial cluster could be subdivided 
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b State-space at t = 250 ms 
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Figure 4 | Phonetic organization of spatial patterns. a, b, Scatterplots of 
consonant-vowel syllables in the first three principal components for 
consonants (25 ms before consonant-vowel transition) (a) and vowels (250 ms 
after consonant-vowel transition) (b). A subset of consonant-vowels are 
labelled with international phonetic alphabet (IPA) symbols, all others have 
dots. Colouring denotes k-means cluster membership. c, d, Hierarchical 
clustering of cortical state-space at consonant (25 ms before consonant-vowel 
transition) and vowel time points (250 ms after consonant-vowel transition). 
Individual syllables and dendrogram branches are colour-coded and labelled by 
known linguistic categories using the same colour scheme as in Fig 4 a, b, with 
new subdivisions of the coronal tongue into front tongue and sibilant (in green 
and red, respectively). Lat, lateral; N. stop, nasal stop; O. stop, oral stop. 

e, f, Correlations between cortical state-space and phonetic features. Black 
vertical lines, medians; grey boxes, 25th and 75th percentiles. ***P < 107°, 
WSRT; n = 297 for both consonants and vowels. NS, not significant. 
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into bi-labial and labiodental. Only at the lowest level of the hierarchy 
did we observe suggestions of organization according to constriction 
degree or shape, such as the sorting of nasal (/n/ syllables), oral stops 
(/d/, /t/) and lateral approximants (/I/). Similarly, during the vowel 
period, a primary distinction was based on the presence or absence of 
lip rounding (/u/ versus /a/ and /i/), and a secondary distinction was 
based on tongue posture (height, and front or back position) (Fig. 4d). 
Therefore, the major oral articulator features that organize consonant 
representations are similar to those for vowels. 

Across an early time period (375 ms before, to 120ms after, 
the consonant-vowel transition), we found that consonant features 
describing constriction location had a significantly greater correla- 
tion with the cortical state-space than constriction degree, which in 
turn was significantly more correlated than the upcoming vowel 
(P<10 1°, Wilcoxon signed-rank test (WSRT), n = 297 from 3 sub- 
jects; see Supplementary Fig. 12 for phonetic feature sets). This 
analysis shows that constriction location accounts for more of the 
structure of spatial activity patterns than does constriction degree 
or shape. Similarly, across a later time period (125 ms to 620 ms 
after the consonant-vowel transition), we found that vowel features 
provided the greatest correlation (vowel configuration versus other 
feature sets, P< 10° 1°, WSRT, n = 297 from 3 subjects). 


Dynamics of phonetic representations 


The dynamics of neural populations have provided insights into the 
structure and function of many neural circuits®*°’’’?****. To deter- 
mine the dynamics of phonetic representations, we investigated 
how state-space trajectories for consonants and vowels entered and 
departed target regions for phonetic clusters. Trajectories of indi- 
vidual consonant-vowel syllables were visualized by plotting their 
locations in the first two principal-component dimensions versus 
time (Fig. 5a, b; principal component 1 (PC1) and PC2 for one of 
the subjects). 

We examined first how trajectories of different consonants transi- 
tioned to a single vowel, /u/ (Fig. 5a). The cortical state-space was 
initially unstructured, and then individual trajectories converged 
within phonetic clusters (for example, labial, front tongue, dorsal 
tongue and sibilant), and at the same time trajectories for different 
clusters diverged from one another. These convergent and divergent 
dynamics gradually increased the separability of different phonetic 
clusters (the mean difference of between-cluster and within-cluster 
distances). Later, as each consonant transitioned to /u/, trajectories 


converged to a compact target region for the vowel. Finally, tra- 
jectories diverged randomly, presumably as articulators returned 
to neutral position. Analogous dynamics were observed during the 
production of a single consonant cluster (for example, labials) tran- 
sitioning to different vowels (/a/, /i/ and /u/) (Fig. 5b). 

We quantified the internal dynamical properties of the cortical 
state-space by calculating cluster separability. The time course of clus- 
ter separability, averaged across subjects and consonant-vowel sylla- 
bles (Fig. 5c) showed that separability peaked approximately 200 ms 
before the consonant-vowel transition for consonants (onset, approxi- 
mately 300 ms before the consonant-vowel transition), and at 250 ms 
after the consonant-vowel transition for vowels (onset, approximately 
50 ms after the consonant-vowel transition). We examined further the 
dynamics of correlations between the structure of the cortical state- 
space and phonetic features (averaged across subjects) (plotted in 
Fig. 5d). Across subjects, we found that cluster separability and the 
correlation between cortical state-space organization and phonetic 
features were tightly linked for both consonants and vowels in a 
time-dependent fashion (R* range = 0.42-0.98, P< 10 '° inall cases). 
This shows that the dynamics of clustering in the cortical state-space 
is coupled strongly to the degree to which the cortical state reflects the 
phonetic structure of the vocalization. 

Visualization of the dynamic structure of the cortical state-space 
during production of all consonant-vowel syllables (Fig. 5e) showed 
that, as the cortical state comes to reflect phonetic structure, different 
phonetic clusters diverge from one another, while the trajectories 
within the clusters converge. Furthermore, we observed correlates 
of the earlier articulatory specification for sibilants (1, /z/, /s/). In 
addition, with all consonant-vowel syllables on the same axes, we 
observed that in comparison to vowels, consonants occupy a distinct 
region of cortical state-space, despite sharing the same articulators. 
The distribution of state-space distances was significantly greater 
in consonant-vowel comparisons than in consonant-consonant or 
vowel-vowel comparisons (P< 10 '° for all comparisons, WSRT, 
n = 4623 in all cases, Supplementary Fig. 11). Finally, the conson- 
ant-to-vowel sequence reveals a periodic structure, which is sub- 
specified for consonant and vowel features. 


Discussion 

Our broad-coverage, high-resolution direct cortical recordings enabled 
us to examine the spatial and temporal profiles of speech articu- 
lator representations in human vSMC. Cortical representations are 
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Figure 5 | Dynamics of phonetic representations. a, b, Cortical state-space 
trajectories. a, Consonants transitioning to the vowel /u/. Each line corresponds 
to a single consonant-vowel trajectory. For each line, the left triangle indicates 
t= —500 ms, the square indicates t = —25 ms, the circle indicates t = 250 ms, 
and the right triangle indicates t = 750 ms. b, Trajectories of the labial 
consonants transitioning to /a/, /i/ and /u/. c, d, Across-subject averages of 
cluster separability (c) and correlation between cortical state-space structure 
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and phonetic features (d) for consonants and vowels (mean + s.e.m.). e, Time- 
course of consonant-vowel syllable trajectories for one subject. Each colour 
corresponds to one of the consonant or vowel groups (colours are the same as in 
aandb above). The centre of each coloured tube is located at the centroid of the 
corresponding phonetic cluster. Tube diameter corresponds to cluster density 
and colour saturation represents the correlation between the structure of the 
cortical state-space and phonetic features. 
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somatotopically organized, with individual sites tuned for a preferred 
articulator and co-modulated by other articulators. The dorsoventral 
layout of articulator representations recapitulates the rostral-to-caudal 
layout of the vocal tract. However, we found an additional laryngeal 
representation located at the dorsal-most end of vVSMC*?°***°, This 
dorsal laryngeal representation seems to be absent in non-human 
primates''’*°’, suggesting a unique feature of human vSMC for the 
specialized control of speech. Pre- and post-central gyrus neural acti- 
vity occurred before vocalization, which may reflect the integration of 
motor commands with proprioceptive information for rapid feedback 
control during speaking’**. 

Just as focal stimulation is insufficient to evoke speech sounds, it is 
not any single articulator representation, but the coordination of mul- 
tiple articulator representations across the VSMC network that gener- 
ates speech. Analysis of spatial patterns of activity showed an emergent 
hierarchy of network states that organizes phonemes by articulatory 
features. This functional hierarchy of network states contrasts with the 
anatomical hierarchy often considered in motor control. The cortical 
state-space organization probably reflects the coordinative patterns 
of articulatory motions during speech, and is notably similar to a 
theorized cross-linguistic hierarchy of phonetic features (‘feature 
geometry’)*'**'*. In particular, the findings support gestural theories 
of speech control’ over alternative acoustic (a hierarchy organized 
primarily by constriction degree)'” or vocal-tract geometry theories 
(no hierarchy of constriction location and degree)"*. 

The vSMC population showed convergent and divergent dynamics 
during the production of different phonetic features. The dynamics 
of individual phonemes were superimposed on a slower oscillation 
that characterizes the transition between consonants and vowels. 
Although trajectories were found to originate or terminate in different 
regions, they consistently pass through the same (target) region of the 
state-space for shared phonetic features*®. Consonants and vowels 
occupy distinct regions of the cortical state-space. Large state-space 
distances between consonant and vowel representations may explain 
why it is more common in speech errors to substitute consonants with 
one another, and vowels with vowels, but very rarely consonants with 
vowels or vowels with consonants (that is, in ‘slips of the tongue’)*”. 

We have shown that a relatively small set of articulator representa- 
tions can combine flexibly to create the large variety of speech sounds 
in American English. The major organizational features found here 
define phonologies of languages from across the world*’. Consequently, 
these cortical organizational principles are likely to be conserved, 
with further specification for unique articulatory properties across 
different languages. 


METHODS SUMMARY 


Three subjects underwent surgical placement of subdural arrays as part of their 
clinical treatment for epilepsy (see Supplementary Table 1 for clinical details). 
Statistical tests were considered significant if the Bonferroni corrected rate of 
incorrectly rejecting the null hypothesis was less than 0.05. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


The experimental protocol was approved by the Human Research Protection 
Program at the University of California, San Francisco. 

Subjects and experimental task. Three native-English-speaking human subjects 
underwent chronic implantation of a high-density, subdural electrocortigraphic 
(ECoG) array over the left hemisphere (two subjects) or right hemisphere (one 
subject) as part of their clinical treatment of epilepsy (see Supplementary Table 1 
for clinical details)**. Subjects gave their written informed consent before the day 
of surgery. All subjects had self-reported normal hearing and underwent neuro- 
psychological language testing (including the Boston naming and verbal fluency 
tests) and scored within the range considered normal. Each subject read aloud 
consonant-vowel syllables composed of 18 or 19 consonants (19 consonants for 
two subjects, 18 consonants for one subject), followed by one of three vowels. 
Each consonant-vowel syllable was produced between 15 and 100 times. Micro- 
phone recordings were synchronized with the multi-channel ECoG data. 

Data acquisition and signal processing. Cortical local field potentials (LFPs) 
were recorded with ECoG arrays and a multi-channel amplifier connected 
optically to a digital signal processor (Tucker-Davis Technologies). The spoken 
syllables were recorded with a microphone, amplified digitally, and recorded 
simultaneously with the ECoG data. ECoG signals were acquired at 3,052 Hz. 

The time series from each channel was inspected visually and quantitatively for 

artefacts or excessive noise (typically 60 Hz line noise). These channels were 
excluded from all subsequent analysis and the raw recorded ECoG signal of the 
remaining channels were then common-average referenced and used for spectro- 
temporal analysis. For each (useable) channel, the time-varying analytic ampli- 
tude was extracted from eight bandpass filters (Gaussian filters, logarithmically 
increasing centre frequencies (85-175 Hz) and semi-logarithmically increasing 
bandwidths) with the Hilbert transform. The high-gamma (high-y) power was 
then calculated by averaging the analytic amplitude across these eight bands, and 
then this signal was down-sampled to 200 Hz. High-y power was z-scored relative 
to the mean and standard deviation of baseline data for each channel. Throughout 
the Methods, high-y power refers to this z-scored measure. 
Acoustic analysis. The recorded speech signal was transcribed off-line using 
WaveSurfer (http://www.speech.kth.se/wavesurfer/). The onset of the consonant- 
to-vowel transition was used as the common temporal reference point for all 
subsequent analysis (see Supplementary Fig. 1). This was chosen because it permits 
alignment across all of the syllables and allows for a consistent discrimination of 
the consonantal and vocalic components. Post-hoc analysis of acoustic timing 
revealed the onset of the consonant-to-vowel transition to be highly reprodu- 
cible across multiple renditions of the same syllable. As such, alignment at the 
consonant-to-vowel transition results in relatively small amounts of inter-syllable 
jitter in estimated times of acoustic onset, offset and peak power. 

For temporal analysis of the consonant-vowel acoustic structure, each indi- 
vidual vocalization was first converted to a cochlear spectrogram by passing the 
sound-pressure waveform through a filter bank emulating the cochlear transfer 
function”. As the current analysis of cortical data leverages the cross-syllabic 
variability in (average) high-y (see below), we reduced the data set of produced 
vocalizations to a single exemplar for each consonant-vowel syllable. For each 
unique consonant-vowel syllable, the cochlear spectrograms associated with each 
utterance of that consonant-vowel (S;(¢,f)) were analysed to find a single proto- 
typical example (Pspct), defined as the syllable that had the minimum (min) 
spectrotemporal difference from every other syllable of that kind: 


Pspct= min s, (Zz(5.)-s0.9)") (1) 


where, S;(t,f) is the spectrogram of the ith example of the syllable, corresponding 
to the power at time t and frequency f. The onset, peak and offset of acoustic 
power were extracted for each syllable prototype using a thresholding procedure. 
Articulator state matrix and phonetic feature matrix. To describe the engage- 
ment of the articulators in the production of different consonant-vowel syllables, 
we drew from standard descriptions of the individual consonant and vowel sounds 
in the International Phonetic Alphabet (IPA)”°. Each consonant-vowel syllable was 
associated with a binary vector describing the engagement of the speech articu- 
lators used to produce the consonant-vowel syllable. For the linear analyses pre- 
sented in Figs 2 and 3, the articulator state vector (B;) for each consonant-vowel 
syllable s; was defined by six binary variables describing the four main articulator 
organs (lips, tongue, larynx and jaw) for consonant production and two vocalic 
tongue configurations (high tongue and back tongue) (Supplementary Fig. 1). 
Although more detailed descriptions are possible (for example, alveolar-dental), 
the linear methods used for these analyses require the articulator variables to be 
linearly independent (no feature can be described as an exact linear combination of 
the others), although the features may have correlations. An expanded phonetic 
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feature matrix (nine consonant constriction location variables, six consonant con- 
striction degree or shape variables, and four vowel tongue and lip configuration 
variables; derived from the IPA, Supplementary Fig. 12) was used in the non- 
parametric analysis of the cortical state-space (Figs 4 and 5). 

Spatial organization derived from a general linear model. To examine the 
spatial organization with which high-y was modulated by the engagement of 
the articulators, we determined how the activity of each electrode varied with 
consonant articulator variables using a general linear model (GLM). Here, at each 
moment in time (t), the GLM described the high-y of each electrode as an 
optimally weighted sum of the articulators engaged during speech production. 
High-y(t) (Hy(#)) recorded on each electrode (e;), during the production of 
syllable s;, Hy;,(t), was modelled as a linear weighted sum of the binary vector 
associated with the consonant component of s;, (BS): 


Hy; ;(t) = f(t)eBS + Bio(t) (2) 


The coefficient vector f(t) that resulted in the least-mean square difference 
between the levels of activity predicted by this model and the observed Hy(t) 
across all syllables was found by linear regression. For each electrode e; at time t, 
the associated 1 X 4 slope vector (f;,(t)) quantifies the degree to which the engage- 
ment of a given articulator modulated the cross-syllable variability in Hy(f) at 
that electrode. Coefficients of determination (R’) were calculated from the 
residuals of this regression. In the current context, R’ can be interpreted as the 
amount of cross-syllabic variability in Hy that can be explained by the optimally 
weighted linear combination of articulatory state variables. 

The spatial organization of the speech articulators was examined using the 
assigned weight vectors (f;(t)) from the GLM described above. First, the fit of 
the GLM at each electrode e; was determined to be of interest if, on average, the 
associated P-value was less than 0.05 for any one of the four consonant articulator 
time windows (T'4; see below) determined from the partial-correlation analysis 
below. We defined this time window to be the average onset-to-offset time of 
statistically significant (P < 0.05) partial correlations for each individual articu- 
lator in each subject (see the section on partial correlation analysis below). This 
method identifies electrodes whose activity is predicted well by the GLM for 
any of the individual articulators, as well as for combinations of these articulators, 
for extended periods of time. As these time windows extend for many points, 
this is a relatively stringent criterion in comparison to a min-finding method or 
looking for single significant-crossings. In practice, the minimum P-values 
(across time) associated with the vast majority of these electrodes are several 
orders of magnitude less then 0.05. For the electrodes that were gauged to have 
statistically significant correlations in each subject, we averaged the weights for 
each articulator (A) in that articulators time window (T',). Thus, each electrode 
of interest (e;) is assigned four values, with each value corresponding to the 
weighting for that articulator, averaged across that articulator’s time window: 


Wh = Bid (3) 
A teT, 

For the analysis of representational overlap at individual electrodes (Sup- 
plementary Fig. 6), each electrode was classified according to the dominant 
articulator weight in a winner-take-all manner. The fractional articulator weight- 
ing was calculated based on the positive weights at each electrode, and was plotted 
as the average percentage of summed positive weights. 

For spatial analysis, the data for each subject were smoothed using a 2-mm 
uniform circular kernel. The maps presented and analysed in Supplementary 
Figs 2 and 3 correspond to these average weights for the lips, tongue, larynx 
and jaw. The maps presented and analysed in Fig. 2 correspond to these average 
weights for each articulator averaged across subjects. The spatial organization 
of vSMC is described by plotting the results of the GLM for an individual on the 
cortex of that individual. We used a Cartesian plane defined by the antero- 
posterior distance from the central sulcus (ordinate) and the dorsoventral dis- 
tance from the Sylvian fissure (azimuth). This provides a consistent reference 
frame to describe the spatial organization of each subject’s cortex and to combine 
data across subjects while preserving the individual differences. 

Somatotopic map and k-nearest neighbours algorithm. To construct the sum- 
mary somatotopic map of Fig. 2b, we first extracted the spatial location of the top 
10% of weights for each articulator (averaged across subjects, data are shown in 
Fig. 2a). We then used a k-nearest neighbour algorithm to classify the surround- 
ing cortical tissue based on the nearest k = 4 neighbours within a spatial extent of 
3mm of each spatial location; if no data points were present within 3 mm, the 
location is unclassified. Locations in which no clear majority (>50%) of the 
nearest neighbours belonged to a single articulator were classified as mixed. 
These values were chosen to convey, in summary form, the visual impression 
of the individual articulator maps, and to ‘fill in’ spatial gaps in our recordings. 


©2013 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


The summary map changed smoothly, and as expected with changes in the 
threshold of weights for each articulator of individual articulator maps, k (num- 
ber of neighbours), spatial extent and minimum number of points. Results are 
qualitatively insensitive to the details of this analysis, including the choice of 10% 
as a threshold, as changes in the clustering algorithm could be made to accom- 
modate subtle differences in data inclusion (for visual comparison, we display the 
somatotopic maps derived from the same algorithm derived from the top 5%, top 
10% and top 15% of weights in Supplementary Fig. 4). 

Partial correlation analysis. To quantify the temporal structure with which 
single-electrode Hy was correlated with the engagement of a single articulator, 
we used partial correlation analysis. Partial correlation analysis is a standard 
statistical tool that quantifies the degree of association between two random 
variables (here, Hy(t) and the engagement of a given articulator, A;), and removes 
the effect of a set of other random variables (here, the other articulators, A;, j ¥ i). 
For a given electrode, the partial correlation coefficient between Hy(t) across 
syllables at time ¢ and articulator A; (p(Hy(t),A;)) is calculated as the correlation 
coefficient between the residuals r(Hy(t),A;), j “i resulting from de-correlating 
the Hy(t) and every other articulator A;, j #i, and the residuals 1(A;,Aj), iA j 
resulting from de-correlating the articulators from one another: 


cov(r(Hy(T), Aj). r(Aj, Aj)) 
01 X02 


p(Hy(t), Aj) AFj (4) 
Where o; and o; are the standard deviations of r(Hy(t),A;) and r(A;,A)), res- 
pectively. In the current context, the partial correlation coefficients quantify 
the degree to which the cross-syllabic variability in Hy at a given moment in time 
was uniquely associated with the engagement of a given articulator during speech 
production. For each articulator, we analysed those electrodes whose peak partial 
correlation coefficient (p) exceeded the mean + 2.5 o of p values across electro- 
des and time (>mean(p(e;,t)) + 2.50(p(e;f))). In the text, we focus on the positive 
correlations (which we denote as R in the text), because there were typically a 
larger number of positive values than negative values (mean p > 0), and in general 
the temporal profiles are similar for negative values and for expositional simpli- 
city. Results did not qualitatively change with changes in this threshold of 
approximately + 0.2 o. We extracted the onset, offset and peak times for each 
articulator for each electrode that crossed this threshold. The data presented in 
Fig. 3d are the mean + s.e.m. of these timing variables across electrodes pooled 
across subjects. The average onset and offset for each of the four consonant 
articulators (lips, tongue, jaw and larynx) in each subject was used to define the 
articulator time window used in the spatial analysis described above. 
Principal component analysis and cortical-state space. Principal components 
analysis (PCA) was carried out on the set of all VSMC electrodes for dimension- 
ality reduction and orthogonalization. PCA was performed on the n X m*t 
covariance matrix Z with rows corresponding to channels (of which there 
are n) and columns corresponding to concatenated Hy(t) (length t) for each 
consonant-vowel (of which there are m). Each electrode’s time series was z-scored 
across syllables to normalize response variability across electrodes. The singular- 
value decomposition of Z was used to find the eigenvector matrix M and asso- 
ciated eigenvalues 4. The principal components (PCs) derived in this way serve 
as a spatial filter of the electrodes, with each electrode e; receiving a weighting in 
PC; equal to Mip where M is the matrix of eigenvectors. Across subjects, we 
observed that the eigenvalues (1) exhibited a fast decay with a sharp inflection 
point at the ninth eigenvalue, followed by a much slower decay thereafter 
(Supplementary Fig. 8). We therefore used the first nine eigenvectors (PCs) as 
the cortical state-space for each subject. 

The cortical state-space representation of syllable s, at time ¢, K(s;,,t), is defined 
as the projection of the vector of cortical activity associated with s, at time t, 
Hy;,(t), onto M: 


K (Sit) =MeH),(t) (5) 


We calculated the contribution of articulators to the cortical state-space (PCw3) 
by projecting each electrode’s weight vector (f; derived from the GLM model 
above) into the first three dimensions of the cortical state-space (i = 1-3): 


PCy, =MyeB; (6) 


Here, PCw; is a four-element vector of the projected articulator weights for 
electrode e; into PC;. In Supplementary Fig. 9, we plot logio of the absolute value 
of PCwy across electrodes, which describes the distribution of magnitudes of the 
representations associated with the four articulators in a given PC. 

Clustering analysis. The k-means and hierarchical clustering analyses were 
carried out on the cortical state-space representations of syllables, K(s;,t), based 
on the pair-wise Euclidean distances calculated between consonant-vowel 
syllable representations. Agglomerative hierarchical clustering used Ward’s 
method. All analyses of the detailed binary phonetic feature matrix were carried 
out using both Hamming and Euclidean distances; results did not change between 
metrics qualitatively or statistically. We used silhouette analysis to validate the 
claim that there were three clusters at the consonant time. The silhouette of a 
cluster is a measure of how close (on average) the members of that cluster are to 
each other, relative to the next nearest cluster. For a particular data set, the average 
silhouette for a given number of clusters describes the parsimony of the num- 
ber of clusters in the data. Hence, examining the silhouette across different 
numbers of clusters gives a quantitative way to determine the most parsimonious 
number of clusters*'. Higher values correspond to more parsimonious clustering. 
On average across subjects, this analysis validated the claim that three clusters 
(average silhouette = 0.47) was a more parsimonious clustering scheme than 
either two (average silhouette = 0.45) or four clusters (average silhouette = 0.43). 
Correlation of cortical state-space structure with phonetic structure. At each 
moment in time, we wanted to quantify the similarity of the structure of cortical 
state-space representations of phonemes and the structure predicted by different 
phonetic feature sets. To do this, we measured the linear correlation coefficient 
between vectors of unique pair-wise Euclidean distances between phonemes cal- 
culated in the cortical state-space (DC(t)) and in the phonetic feature matrix (DP): 


= cov(DC(t), DP) 


Opc(t) * ODP 


R(t) (7) 


As described above, the phonetic feature matrix was composed of three distinct 
phonetic feature sets (consonant constriction location, consonant constriction 
degree or shape, vowel configuration). Distances were calculated indepen- 
dently in these three subsets and correlated with DC(t). Standard error measures 
of the correlation coefficients were calculated using a bootstrap procedure (1,000 
iterations). 

Cluster separability. Cluster separability is defined at any moment in time as the 
difference between the average of cross-cluster distances and the average of 
within-cluster distances. This quantifies the average difference of the distance 
between syllables in different clusters and the tightness of a given cluster. We 
quantified the variability in cluster separability estimation using a 1,000-iteration 
bootstrap procedure of the syllables used to calculate the metric. 

Cluster density. We quantified the average cluster density by calculating the 
average inverse of all unique pair-wise distances between consonant-vowels in 
a given cortical state-space cluster. It is a density because the number of elements 
in a cluster does not change with time. 


48. Mesgarani, N.& Chang, E. F. Selective cortical representation of attended speaker 
in multi-talker speech perception. Nature 485, 233-236 (2012). 

49. Yang, X. et al. Auditory representations of acoustic signals. /EEE Transactions Inf. 
Theor. 38, 824-839 (1992). 

50. International Phonetic Association. Handbook of the International Phonetic 
Association (Cambridge Univ. Press, 1999). 

51. Rousseeuw, P. Silhouettes: a graphical aid to the interpretation and validation of 
cluster analysis. J. Comput Appl. Math. 20, 53-65 (1987). 


©2013 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


doi:10.1038/nature11928 


Circular RNAs are a large class of animal 
RNAs with regulatory potency 


Sebastian Memczak!*, Marvin Jens!*, Antigoni Elefsinioti!*, Francesca Torti', Janna Krueger’, Agnieszka Rybak', Luisa Maier’, 
Sebastian D. Mackowiak!, Lea H. Gregersen’, Mathias Munschauer?, Alexander Loewer*, Ulrike Ziebold!, Markus Landthaler’, 
Christine Kocks', Ferdinand le Noble* & Nikolaus Rajewsky' 


Circular RNAs (circRNAs) in animals are an enigmatic class of RNA with unknown function. To explore circRNAs 
systematically, we sequenced and computationally analysed human, mouse and nematode RNA. We detected 
thousands of well-expressed, stable circRNAs, often showing tissue/developmental-stage-specific expression. 
Sequence analysis indicated important regulatory functions for cireRNAs. We found that a human circRNA, antisense 
to the cerebellar degeneration-related protein 1 transcript (CDRlas), is densely bound by microRNA (miRNA) effector 
complexes and harbours 63 conserved binding sites for the ancient miRNA miR-7. Further analyses indicated that 
CDRlas functions to bind miR-7 in neuronal tissues. Human CDRlas expression in zebrafish impaired midbrain 
development, similar to knocking down miR-7, suggesting that CDRlas is a miRNA antagonist with a miRNA-binding 
capacity ten times higher than any other known transcript. Together, our data provide evidence that circRNAs form a 
large class of post-transcriptional regulators. Numerous circRNAs form by head-to-tail splicing of exons, suggesting 


previously unrecognized regulatory potential of coding sequences. 


Mature messenger RNAs are linear molecules with 5’ and 3’ termini 
that reflect start and stop of the RNA polymerase on the DNA tem- 
plate. In cells, different RNA molecules are sometimes joined together 
by splicing reactions (trans-splicing), but covalent linkage of the ends 
of a single RNA molecule to form a circular RNA (circRNA) is usually 
considered a rare event. circRNAs were discovered in plants and shown 
to encode subviral agents’. In unicellular organisms, circRNAs mostly 
stem from self-splicing introns of pre-ribosomal RNA’, but can also 
arise from protein-coding genes in archaea’. In the few unambiguously 
validated circRNAs in animals, the spliceosome seems to link the 5’ 
and downstream 3’ ends of exons within the same transcript*”®. 
Perhaps the best known circRNA is antisense to the mRNA transcribed 
from the SRY (sex-determining region Y) locus and is highly expressed 
in testes®. Evidence from computational analyses of expression data in 
Archaea and Mammalia suggests that circRNAs are more prevalent 
than previously thought*”®; however, it is unknown whether animal 
circRNAs have any biological function. 

In comparison to circRNAs, miRNAs are extremely well studied. 
miRNAs are ~21-nucleotide-long non-coding RNAs that guide the 
effector protein Argonaute (AGO) to mRNAs of coding genes to 
repress their protein production’’"*. In humans, miRNAs directly 
regulate expression of most mRNAs'*"* in a diverse range of bio- 
logical functions. However, surprisingly little is known about how 
and if mRNAs can escape regulation by a miRNA. A recently discov- 
ered mechanism for miRNA removal in a sequence-specific manner is 
based on target sites acting as decoys or miRNA sponges’””’. RNA with 
miRNA binding sites should, if expressed highly enough, sequester 
away the miRNA from its target sites. However, all reported mam- 
malian miRNA sponges have only one or two binding sites for the same 
miRNA and are not highly expressed, limiting their potency*!™. 

To identify circRNAs across animal cells systematically, we screened 
RNA-seq data for circRNAs. Compared to previous approaches” our 


computational pipeline can find circRNAs in any genomic region, 
takes advantage of long (~100 nucleotides) reads, and predicts the 
acceptor and donor splice sites used to link the ends of the RNAs. 
We do not rely on paired-end sequencing data or known splice sites. 
Using published’®***’® and our own sequencing data, our method 
reported thousands of circRNAs in human and mouse tissues as well 
as in different developmental stages of Caenorhabditis elegans. 
Numerous circRNAs appear to be specifically expressed across tissues 
or developmental stages. We validated these data and showed that 
most tested circRNAs are well expressed, stable and circularized 
using the predicted splice sites. circRNA sequences were significantly 
enriched in conserved nucleotides, indicating that circRNAs compete 
with other RNAs for binding by RNA binding proteins (RBPs) or 
miRNAs. We combined biochemical, functional and computational 
analyses to show that indeed a known human circRNA, CDR1 anti- 
sense (CDRlas)’, can function as a negative regulator of miR-7, a 
miRNA with perfect sequence conservation from annelids to human. 
Together, our data provide evidence that circRNAs form an important 
class of post-transcriptional regulators. 


circRNAs have complex expression patterns 


To comprehensively identify stably expressed circRNAs in animals we 
screened RNA sequencing reads for splice junctions formed by an 
acceptor splice site at the 5’ end of an exon and a donor site at a 
downstream 3’ end (head-to-tail) (Fig. la). As standard RNA expres- 
sion profiling enriches for polyadenylated RNAs, we used data gene- 
rated after ribosomal RNA depletion (ribominus) and random 
priming. Such data were used before to detect scrambled exons in 
mammals’? (see Methods for comparison). However, this approach 
was not specifically designed to detect circRNAs and (1) only used 
existing exon-intron annotations, thus missing RNAs transcribed 
from introns or unannotated transcripts; (2) did not explicitly identify 
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Figure 1 | Detection, classification and evolutionary conservation of 
circRNAs. a, The termini of junction-spanning reads (anchors) align 
sequentially to the genome for linear (top) but in reversed orientation for head- 
to-tail spliced reads (bottom). Spliced reads must distribute completely to 
anchors, flanked by AG/GU (Methods). b, ¢, circRNAs in human cell types 
(b) and nematode stages (c). d, Genomic origin of human circRNAs. A total of 
96% of circRNAs overlap known transcripts. e, Examples of human circRNAs. 
The AFFI intron is spliced out (Supplementary Fig. 2e). Sequence conservation: 
placental mammals phyloP score (Methods), scale bar, 200 nucleotides. 

f, A total of 223 human coding sequence circRNAs with mouse orthologues 
(green) and controls (black) with matched conservation level (inset: mean 
conservation for each codon position (grey), controls (black); x axis, codon 
positions; y axis, placental mammals phyloP score; see also Methods and 
Supplementary Fig. 1j, k). Third codon positions are significantly more 
conserved (P< 4 X 10 2°, Mann-Whitney U-test, n = 223). 


the splice sites used for circularization; and (3) assumed that each pair 
of mates in paired-end sequencing derives from the same RNA mole- 
cule. To search in a more unbiased way for circRNAs, we designed 
an algorithm (Methods) that identifies linear and circular splicing 
events in ribominus data. First, we filtered out reads that aligned con- 
tiguously to the genome, retaining the spliced reads. Next, we mapped 
the terminal parts of each candidate read independently to the genome 
to find unique anchor positions. Finally, we demanded that (1) anchor 
alignments can be extended such that the original read sequence 
aligns completely, and (2) the inferred breakpoint is flanked by GU/ 
AG splice signals. Non-unique mappings and ambiguous breakpoints 
were discarded. We detected circularization splicing from the reversed 
(head-to-tail) orientation of the anchor alignments (Fig. la). Our 
method also recovered tens of thousands of known linear splicing 
events (Methods and Supplementary Fig. la, b). We estimated sen- 
sitivity (>75%) and false-discovery rate (FDR <0.2%) using simulated 
reads and various permutations of real sequencing data (Methods and 
Supplementary Fig. 1c). However, the efficiency of ribominus pro- 
tocols to extract and sequence circRNAs is limited, reducing overall 
sensitivity. 

We generated ribominus data for HEK293 cells and, combined 
with human leukocyte data’®, detected 1,950 circRNAs with support 
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from at least two independent junction-spanning reads (Fig. 1b). The 
expression of genes predicted to give rise to circRNAs was only slightly 
shifted towards higher expression values (Supplementary Fig. 1d), 
indicating that circRNAs are not just rare mistakes of the spliceo- 
some. We also identified 1,903 circRNAs in mouse (brains, fetal 
head, differentiation-induced embryonic stem cells; Supplementary 
Fig. le)*°?°; 81 of these mapped to human circRNAs (Supplemen- 
tary Fig. 1f). To explore whether circRNAs exist in other animal clades, 
we used sequencing data that we produced from various C. elegans 
developmental stages (Stoeckius, M. et al., manuscript in preparation) 
(Methods) and detected 724 circRNAs, with at least two independent 
reads (Fig. Ic). 

Numerous circRNAs seem to be specifically expressed in a cell type 
or developmental stage (Fig. 1b, c and Supplementary Fig. le). For 
example, hsa-circRNA 2149 is supported by 13 unique, head-to-tail 
spanning reads in CD19” leukocytes but is not detected in CD34* 
leukocytes (which were sequenced at comparable depth; Supplemen- 
tary Table 1), neutrophils or HEK293 cells. Analogously, a number of 
nematode circRNAs seem to be expressed in oocytes but absent in 
1- or 2-cell embryos. 

We annotated human circRNAs using the RefSeq database and a 
catalogue of non-coding RNAs’. 85% of human circRNAs align 
sense to known genes. Their splice sites typically span one to five 
exons (Supplementary Fig. 1g) and overlap coding exons (84%), but 
only in 65% of these cases are both splice sites that participate in the 
circularization known splice sites (Supplementary Table 2), demon- 
strating the advantage of our strategy. 10% of all circRNAs align 
antisense to known transcripts, smaller fractions align to UTRs, 
introns, unannotated regions of the genome (Fig. 1d). Examples of 
human circRNAs are shown in Fig. le. 

We analysed sequence conservation within circRNAs. As genomic 
sequence is subject to different degrees of evolutionary selection, 
depending on function, we studied three subtypes of circRNAs. 
Intergenic and a few intronic circRNAs display a mild but significant 
enrichment of conserved nucleotides (Supplementary Fig. 1h, i). 
To analyse circRNAs composed of coding sequence and thus high 
overall conservation, we selected 223 human circRNAs with circular 
orthologues in mouse (Methods) and entirely composed of coding 
sequence. Control (linear) exons were randomly selected to match the 
level of conservation observed in first and second codon positions 
(Methods, Fig. 1f inset and Supplementary Fig. 1k for conservation 
of the remaining coding sequence (CDS)). circRNAs with conserved 
circularization were significantly more conserved in the third codon 
position than controls, indicating evolutionary constraints at the nuc- 
leotide level, in addition to selection at the protein level (Fig. 1f and 
Supplementary Fig. 1j, k). In summary, we have confidently identified 
a large number of circRNAs with complex expression patterns, which 
derive often but not always from coding exons. Sequence conservation 
suggests that at least a subset contains functional sequence elements. 


Characterization of 50 predicted circRNAs 


We experimentally tested our circRNA predictions in HEK293 cells. 
Head-to-tail splicing was assayed by quantitative polymerase chain 
reaction (qPCR) after reverse transcription, with divergent primers 
and Sanger sequencing (Fig. 2a, b). Predicted head-to-tail junctions 
of 19 out of 23 randomly chosen circRNAs (83%) could be validated, 
demonstrating high accuracy of our predictions (Table 1). In contrast, 
5 out of 7 (71%) candidates exclusively predicted in leukocytes could not 
be detected in HEK293 cells, validating cell-type-specific expression. 
Head-to-tail splicing could be produced by trans-splicing or geno- 
mic rearrangements. To rule out these possibilities as well as potential 
PCR artefacts, we successfully validated the insensitivity of human 
circRNA candidates to digestion with RNase R—an exonuclease that 
degrades linear RNA molecules*°—by northern blotting with probes 
which span the head-to-tail junctions (Fig. 2c). We quantified RNase 
R resistance for 21 candidates with confirmed head-to-tail splicing by 
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Figure 2 | CircRNAs are stable transcripts with robust expression. 

a, Human (hsa) ZRANB1 circRNA exemplifies the validation strategy. 
Convergent (divergent) primers detect total (circular) RNAs. Sanger 
sequencing confirms head-to-tail splicing. b, Divergent primers amplify 
circRNAs in cDNA but not genomic DNA (gDNA). GAPDH, linear control, 
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significantly enriched compared to coding sequences (P< 2.96 X 10”, 
Mann-Whitney U-test, n = 3,873) or 3’ UTR sequences (P< 2.76 
x 1077}, Mann-Whitney U-test, n = 3,182) (Supplementary Fig. 3a, b). 

As an extreme case, we discovered that the known human circRNA 
CDRilas (ref. 9) harboured dozens of conserved miR-7 seed matches. 
To test whether CDRlas is bound by miRNAs, we analysed bio- 
chemical, transcriptome-wide binding-site data for the miRNA 
effector AGO proteins. We performed four independent PAR-CLIP 
(photoactivatable-ribonucleoside-enhanced crosslinking and immu- 
noprecipitation) experiments for human AGO (Methods) and ana- 
lysed them together with published, lower-depth data**. PAR-CLIP**** 
is based on ultraviolet crosslinking of RNA to protein and subsequent 
sequencing of RNA bound to a RBP of interest. The ~ 1.5-kilobase (kb) 
CDRlas locus stood out in density and number of AGO PAR-CLIP 
reads (Fig. 3a), whereas nine combined PAR-CLIP libraries for other 
RBPs gave virtually no signal. Of note, there is no PAR-CLIP read 
mapping to the sense coding transcript of the CDR1 gene, which was 
originally identified as a target of autoantibodies from patients with 
paraneoplastic cerebellar degeneration”>. 

Sequence analysis across 32 vertebrate species revealed that miR-7 
is the only animal miRNA with conserved seed matches that can 
explain the AGO binding along the CDRl1as transcript (Methods). 
Human CDRlas harbours 74 miR-7 seed matches of which 63 are 


AGO PAR-CLIP reads 


size marker in base pairs. c, Northern blots of mock (—) and RNase R (+) 
treated HEK293 total RNA with head-to-tail specific probes for circRNAs. 
GAPDH, linear control. d, e, circRNAs are at least 10-fold more RNase R 
resistant than GAPDH mRNA (d) and stable after 24h transcription block 
(e) (qPCR; error bars indicate standard deviation). 


qPCR. All of these were at least 10-fold more resistant than GAPDH 
(Fig. 2d and Supplementary Fig. 2a). We reasoned that circRNAs 
should generally turn over more slowly than mRNAs. Indeed, we 
found that 24h after blocking transcription circRNAs were highly 
stable, exceeding the stability of the housekeeping gene GAPDH" 
(Fig. 2e and Supplementary Fig. 2b). We also validated 3 out of 3 
tested mouse circRNAs with human orthologues in mouse brains 
(Supplementary Fig. 2c). In C. elegans 15 out of 20 (75%) of the pre- 
dictions from gametes and early embryos were validated in a mixed 
stage sample (Supplementary Fig. 2d and Supplementary Table 3). 


circRNA CDRlas is densely bound by AGO 


Stable transcripts with many miRNA-binding sites could function as 
miRNA sponges. We intersected our catalogue of circRNAs with 
transcript annotations, assuming that introns would not occur in 
mature circRNAs (as observed for 3 out of 3 tested circRNAs, 
Supplementary Fig. 2e). We screened for occurrences of conserved 
miRNA family seed matches (Methods). When counting repetitions 
of conserved matches to the same miRNA family, circRNAs were 


Table 1 | Summary of the validation experiments 


Sample Validation experiment Validation success 

Human (HEK293) Head-to-tail splicing 19 of 23 
Circularity 21 of 21 
Expression >3% vinculin 12 of 21 
Expression specificity 5 of 7 
(leukocyte specific) 

Mouse (adult brain) Head-to-tail splicing 3 of 3 
Circularity 3 of 3 
Expression >1% B-actin 2 of 3 

C. elegans Head-to-tail splicing 15 of 20 
Circularity 13 of 13 
Expression >1% eif-3.d 12 of 15 


Most experimentally tested circRNAs are validated. 
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Figure 3 | The circRNA CDRlas is bound by the miRNA effector protein 
AGO, and is cytoplasmic. a, CDR1as is densely bound by AGO (red) but not 
by unrelated proteins (black). Blue boxes indicate miR-7 seed matches. nt, 
nucleotides. b, c, miR-7 sites display reduced nucleotide variability across 32 
vertebrate genomes (b) and high base-pairing probability within seed matches 
(c). d, CDRlas RNA is cytoplasmic and disperse (white spots; single-molecule 
RNA FISH; maximum intensity merges of Z-stacks). sisCR, positive; siRNA1, 
negative control. Blue, nuclei (DAPI); scale bar, 5 j1m (see also Supplementary 
Fig. 10 for uncropped images). e, Northern blotting detects circular but not 
linear CDRlas in HEK293 RNA. Total, HEK293 RNA; circular, head-to-tail 
probe; circ+lin, probe within splice sites; IVT lin., in vitro transcribed, linear 
CDRlas RNA. f, Circular CDR1as is highly expressed (qPCR, error bars 
indicate standard deviation). g, CDR1as. Blue, seed matches; dark red, AGO 
PAR-CLIP reads; bright red, crosslinked nucleotide conversions. 
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conserved in at least one other species (Supplementary Fig. 4). 
Interspaced sequences were less conserved, indicating that miR-7 
binding sites are probably functional (Fig. 3b). Secondary structure 
analysis of predicted circRNA-miRNA duplexes (Methods) showed 
reduced base-pairing of miR-7 beyond the seed (Fig. 3c). None of the 
~1,500 miR-7 complementary sites across 32 vertebrate sequences 
was complementary beyond position 12 of miR-7 (only three could 
form an 11-nucleotide duplex) (Supplementary Table 4). Slicing by 
mammalian Argonaute requires complementarity of positions 10 and 
11 and depends on extended complementarity beyond position 12 
(ref. 36). Thus, CDRlas seems optimized to be densely bound but not 
sliced by miR-7. 

Single-molecule imaging (Methods) revealed disperse and mostly 
cytoplasmic CDRlas expression (HEK293 cells), consistent with 
miRNA sponge function (Fig. 3d and Supplementary Table 5). 
CDRlas circularization was assayed by northern blotting (Fig. 3e). 
Nicking experiments confirmed that CDRlas circRNA can be linea- 
rized and degraded (Supplementary Fig. 5a). In RNA from HEK293 
cells, circularized but no additional linear CDRlas was detected 
(Supplementary Fig. 5b). Circular expression levels were quantified 
by qPCR with divergent primers calibrated by standard curves 
(Supplementary Table 6). CDRlas was highly expressed (~15% to 
~20% of GAPDH expression, Fig. 3f). Estimating GAPDH mRNA 
copy number from HEK293 RNA-seq data (~1,400 molecules per 
cell, data not shown) suggests that CDRlas may bind up to ~20,000 
miR-7 molecules per cell (Fig. 3g). 

If CDRlas functions as a miR-7 sponge, its destruction could trigger 
downregulation of miR-7 targets. We knocked down CDRlas in 
HEK293 cells and monitored expression of published miR-7 targets 
by qPCR with externally spiked-in standards (Methods and Supplemen- 
tary Fig. 5c, d). All eight miR-7 targets assayed, but also housekeeping 
genes, were downregulated. Nanostring technology” additionally indi- 
cated downregulation of many genes (data not shown). Furthermore, 
stable loss of CDRlas expression by virally delivered small hairpin 
RNAs led to significantly reduced migration in an in vitro wound clo- 
sure assay (Methods, Supplementary Fig. 5e, f and Supplementary 
Table 7). Thus, knockdown of CDRlas affects HEK293 cells, but we 
could not delineate miR-7-specific effects, potentially because of indirect 
or miR-7-independent CDR1as function (see below). 


Co-expression of miR-7 and CDRIas in brain 


If CDR1las indeed interacts with miR-7, both must be co-expressed. 
miR-7 is highly expressed in neuronal tissues, pancreas and pituitary 
gland**. Apart from HEK293 cells, a cell line probably derived from 
neuronal precursors in embryonic kidney’, we quantified miR-7 and 
CDRlas expression across mouse tissues and pancreatic-island- 
derived MIN6 cells (Methods and Fig. 4a). CDRlas and miR-7 were 
both highly expressed in brain tissues, but CDRlas was expressed at 
low levels or absent in non-neuronal tissues, including tissues with 
very high miR-7 expression. qPCR suggested that CDRlas is exclu- 
sively circular in adult and embryonic mouse brain (Supplementary 
Fig. 5g, h). Thus, CDRlas and miR-7 seem to interact specifically 
in neuronal tissues. Indeed, when assaying CDRlas and miR-7 in 
mouse brains by in situ hybridizations (Methods), we observed spe- 
cific, similar, but not identical, expression patterns in the brain of 
mid-gestation (embryonic day 13.5 (E13.5)) embryos (Fig. 4b). Speci- 
fically, CDRlas and miR-7 were highly co-expressed in areas of the 
developing midbrain (mesencephalon)***'. Thus, CDR1as is highly 
expressed, stable, cytoplasmic, not detectable as a linear RNA and 
shares expression domains with miR-7. Together with extensive 
miR-7 binding within CDRlas, CDR1as has hallmarks of a potent 
circular miR-7 sponge in neuronal tissues. 


Effects of miR-7 and CDR1as in zebrafish 


It would be informative to knock out CDRlas in an animal model 
system. However, a knockout would also affect CDR1 protein, with 
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Figure 4 | CDRlas and miR-7 have overlapping and specific expression in 
neuronal tissues. a, Among mouse tissues and MIN6 cells (qPCR, relative to 
cerebral cortex expression; error bars indicate standard deviations; see 
Supplementary Fig. 9a for miR-122 control) neuronal tissues co-express miR-7 
and CDRias. b, In situ staining of CDRlas and miR-7 in mouse embryo brain 
E13.5 (U6 and miR-124, positive control; scrambled probe, negative control). 
Scale bar, 1 mm. 


unknown consequences. This problem is circumvented when using zeb- 
rafish (Danio rerio) as an animal model. According to our bioinformatic 
analyses (not shown) zebrafish has lost the cdr1 locus, whereas miR-7 is 
conserved and highly expressed in the embryonic brain*. Thus, we can 
test whether miR-7 has a loss-of-function phenotype and if this pheno- 
type can be induced by introduction of mammalian CDRlas RNA. We 
injected morpholinos to knock down mature miR-7 expression in zebra- 
fish embryos (Methods). At a dose of 9ng of miR-7 morpholino, the 
embryos did not show overall morphological defects but reproducibly, 
and in two independent genetic backgrounds (Supplementary Fig. 6a—c), 
developed brain defects (Fig. 5a, b). In particular, ~70% showed a con- 
sistent and clear reduction in midbrain size, and an additional ~5% of 
animals had almost completely lost their midbrains. Of note, the tel- 
encephalon at the anterior tip of the brain was not affected in size. Brain 
volumes were also measured based on confocal three-dimensional stacks 
(Fig. 5c and Supplementary Fig. 7). Reduction of the midbrain size 
correlated with miR-7 inhibition in the respective animals (Supplemen- 
tary Fig. 6d). These data provide evidence that miR-7 loss-of-function 
causes a specific reduction of midbrain size. 

To test whether CDRlas can function as a miR-7 sponge in vivo, we 
injected embryos with plasmid DNA that expressed a linear version of 
the full-length human CDR1as sequence (Supplementary Fig. 6e, f) or 
a plasmid provided by the Kjems laboratory that can produce circular 
CDRlas in human cells (Fig. 5d, e). qPCR analysis detected circular 
RNA in zebrafish embryos injected with the latter plasmid (Sup- 
plementary Fig. 8), which reproducibly and in independent genetic 
backgrounds lead to reduced midbrain sizes (Fig. 5g, h). Similarly, 
animals injected with in vitro-transcribed partial mouse CDRlas 
RNA, but not with RNA from the other strand, showed significant 
midbrain reduction (Supplementary Fig. 6g-i). Thus, the phenotype 
is probably caused by CDRlas RNA and not by an unspecific effect of 
RNA or DNA injection. These results provide evidence that human/ 
mouse CDRlas transcripts are biologically active in vivo and impair 
brain development similarly to miR-7 inhibition. The midbrain 
reduction could be partially rescued by injecting miR-7 precursor 
(Fig. 5f, g), arguing that the biological effect of CDRlas expression 
is caused at least in part by interaction of CDRlas with miR-7. 


Discussion 


We have shown that animal genomes express thousands of circRNAs 
from diverse genomic locations (for example, from coding and non- 
coding exons, intergenic regions or transcripts antisense to 5’ and 
3' UTRs) in a complex tissue-, cell-type- or developmental-stage- 
specific manner. We provided evidence that CDRlas can act as a 
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Figure 5 | In zebrafish, knockdown of miR-7 or expression of CDRlas 
causes midbrain defects. a, b, Neuronal reporter (Tg(huC:egfp)) embryos (top, 
light microscopy) 48h post fertilization (bottom, representative confocal 
z-stack projections; blue dashed line, telencephalon (TC) (control); yellow 
dashed line, midbrain (MB)). Embryos after injection of 9 ng miR-7 
morpholino (MO) (b) display a reduction in midbrain size. Panel a shows a 
representative embryo injected with 15 ng control morpholino. c, Three- 
dimensional volumetric reconstructions. d, Empty vector control. e, Expression 
vector encoding human circular CDRlas. f, Rescue experiment with miR-7 
precursor. g, Phenotype penetrance (% of embryos, miR-7 MO, n = 135; 
uninjected, n = 83; empty vector, n = 91; linear CDRlas, n = 258; circular 
CDRlas, n = 153; circular CDR1as plus miR-7 precursor, n = 217). Phenotype 
distribution derived from at least three independent experiments. Scale bar, 
0.1 mm. **P < 0.01; ***P < 0.001 in Students t-test for normal midbrain, 
reduced midbrain (see also Supplementary Fig. 6). h, Phenotype quantification 
(Methods). Error bars indicate standard deviation n = 3 per group. 


post-transcriptional regulator by binding miR-7 in brain tissues: (1) 
CDRIas is densely bound by miRNA effector molecules; (2) CDRlas 
harbours 74 miR-7 seed matches, often deeply conserved; (3) CDRlas 
is expressed highly, stably and mostly cytoplasmic; (4) CDRlas and 
miR-7 share specific expression domains in mouse embryonic brain; 
(5) human/mouse CDRIas is circularized in vivo and is not detectable 
as a linear molecule; (6) human/mouse CDRlas sequences, when 
injected into zebrafish, and miR-7 knock down have similar pheno- 
types in brain. While zebrafish circularization of human CDRlas may 
be incomplete, the midbrain phenotype was stronger compared to 
expressing linear CDRlas RNA that lacks circularization splice sites. 
Although the two DNA plasmids used carry identical promoters and 
were injected in equal concentrations, we cannot rule out the possibility 
that the difference in midbrain phenotype strength may be explained 
by other factors. However, because of the observed extreme stability 
of CDRlas and circRNAs in general, our data argue that circRNAs 
can be used as potent inhibitors of miRNAs or RBPs. Future studies 
should elucidate how CDRlas can be converted into a linear mole- 
cule and targeted for degradation. miR-671 can trigger destruction of 
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CDRlas’. Thus, CDRlas may function to transport miR-7 to subcel- 
lular locations, where miR-671 could trigger release of its cargo. Known 
functions of miR-7 targets such as PAK1 and FAK] support these 
speculations. 

The phenotype induced by CDRlas expression in zebrafish was 
only partially rescued by expressing miR-7, indicating that CDRlas 
could have functions beyond sequestering miR-7. This idea is sup- 
ported by in situ hybridization in mouse adult hippocampus (Sup- 
plementary Fig. 9b) where areas staining for CDRlas but not miR-7 
were observed. What could be additional functions of circRNAs 
beyond acting as sponges? As a single-stranded RNA, CDRIlas could, 
for example, bind in trans 3' UTRs of target mRNAs to regulate their 
expression. It is even possible that miR-7 binds CDRlas to silence 
these trans-acting activities. Alternatively, CDRlas could be involved 
in the assembly of larger complexes of RNA or protein, perhaps 
similar to other low-complexity molecules”. 

How many other circRNAs exist? In this study, we identified appro- 
ximately 2,000 human, 1,900 mouse and 700 nematode circRNAs 
from sequencing data, and our validation experiments confirmed 
most of the 50 tested circRNAs. However, we analysed only a few 
tissues/developmental stages with stringent cutoffs. Thus, the true 
number of circRNAs is almost certainly much larger. Although 
CDRlas is an extreme case, many circRNAs have conserved seed 
matches. For example, circRNA from the SRY locus’ has seed sites 
for murine miRNAs. Therefore, circRNAs probably compete with 
other RNAs for miRNA binding. Sequence analyses indicated that 
coding exons serve additional, presumably regulatory functions when 
expressed within circRNAs, whereas intergenic or intronic circRNAs 
generally showed only weak conservation. Because we detected thou- 
sands of circRNAs, it is appealing to speculate that occasional circu- 
larization of exons is easy to evolve and may provide a mechanism 
for rapid evolution of stably and well expressed regulatory RNAs. Of 
note, we detected multiple seed matches for viral miRNAs within 
human circRNAs (not shown). However, there is no reason to think 
that circRNAs function predominantly to bind miRNAs. As known 
in bacteria, the decoy mechanism underlying miRNA sponges could 
be important also for RBPs***’. Similarly, circRNAs could function 
to store, sort, or localize RBPs. In summary, our data suggest that 
circRNAs form a class of post-transcriptional regulators which com- 
pete with other RNAs for binding by miRNAs and RBPs and may 
generally function in modulating the local free concentration of RBPs, 
RNAs, or their binding sites. 

Note added in proof: While this paper was under review, circular 
RNAs in fibroblasts were described*'. 


METHODS SUMMARY 

Computational pipeline for predicting circRNAs from ribominus sequencing 
data. A detailed description of the computational methods is given in the 
Methods. 

Cell culture and treatments. HEK293, HEK293TN and HEK293 Flp-In 293 
T-REx (Life Technologies) were cultured following standard protocols. Tran- 
scription was blocked by adding 21g ml~' actinomycin D (Sigma). RNase R 
(Epicentre Biotechnologies) treatment (3 U ug — 1) was performed on total RNA 
(5 jg) at 37 °C for 15 min. qPCR primers are listed in Supplementary Table 8. 
Single-molecule RNA fluorescence in situ hybridization (smRNA FISH). 
Stellaris Oligonucleotide probes complementary to CDR1as were designed using 
the Stellaris Probe Designer (Biosearch Technologies). Probe pools were obtained 
from BioCat GmbH as conjugates coupled to Quasar 670. Probes were hybridized 
at 125 nM at 37°C. Images were acquired on an inverted Nikon Ti microscope. 
Mouse strains and in situ hybridization. In situ hybridization (ISH) was per- 
formed on paraffin tissue sections from B6129SF1/J wild-type mice as described** 
using locked nucleic acid (LNA) probes or RNAs obtained by in vitro transcrip- 
tion on PCR products. 

Zebrafish methods. Tg(huC:egfp) and Tg(Xia. Tubb:dsRED) transgenic zebrafish 
lines were used*”*°. Morpholino antisense oligomers were injected into the yolk of 
single-cell-stage embryos. Furthermore, two pCS2+ plasmids coding for full- 
length linear CDRlas or CDRIas plus upstream and downstream sequence that 
can express circular CDR1as in human cells (courtesy of the Kjems laboratory) 
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were injected. Confocal imaging was performed using Carl Zeiss MicroImaging. 
Reduced midbrain development was defined as >50% smaller than the mean size 
of controls. Each experimental group was evaluated in at least three independent 
experiments; a minimum of 80 individual embryos per group was examined. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Computational pipeline for predicting circRNAs from ribominus sequencing 
data. Reference genomes (human hg19 (February 2009, GRCh37), mouse mm9 
(July 2007, NB137/mm49), C. elegans ce6 (May 2008, WormBase v. WS190)) were 
downloaded from the UCSC genome browser (http://genome.ucsc.edu/)”’”. In a 
first step, reads that aligned contiguously and full length to the genomes were 
discarded. From the remaining reads we extracted 20mers from both ends and 
aligned them independently to find unique anchor positions within spliced exons. 
Anchors that aligned in the reversed orientation (head-to-tail) indicated circRNA 
splicing (compare main Fig. 1a). We extended the anchor alignments such that 
the complete read aligns and the breakpoints were flanked by GU/AG splice sites. 
Ambiguous breakpoints were discarded. We used the short-read mapper Bowtie 
2 (ref. 52). Initially, ribominus reads were aligned in end-to-end mode to the 
genome: 


$ bowtie2 -p16 --very-sensitive --phred64 --mm -M20 --score-min=C, -15, 0 -q-x 
<index> -U reads.qfa 2> bowtie2.log | samtools view -hbuS - | samtools sort - 
sample_vs_genome 


The unmapped reads were separated and run through a custom script to split the 
reads as indicated in Fig. 1a to obtain 20-nucleotide anchors from both ends of the 
read: 


$ samtools view -hf 4 sample_vs_genome.bam | samtools view -Sb - > unmapped_ 
sample.bam 


$ ./unmapped2anchors.py unmapped_sample.bam | gzip > sample_anchors.qfa. 
gz 


Here is an example of two anchor pairs in the FASTQ format; the original 
read was kept as part of the first anchors identifier to simplify downstream 
analysis: 


@s_8_1_0001_qseq_14_A_ | NCCCGCCTCACCGGGTCAGTGAAAAAACGA 
TCAGAGTAGTGGTCTTCTTCCGGCGGCCCCGCGCGCGCCGCGCTGC 


NCCCGCCTCACCGGGTCAGT 
+4 

#BBG@?@ABG; = GB; BEG58 ( 
@s_8_1_0001_qseq_14_B 


CCCCGCGCGCGCCGCGCTGC 
+ 
Fag (>) 50; . BHHREETHH 


Next the anchors were aligned individually to the reference, keeping their paired 
ordering. The resulting alignments were read by another custom script that 
jointly evaluates consecutive anchor alignments belonging to the same original 
read, performs extensions of the anchor alignments, and collects statistics on 
splice sites. After the run completes, the script outputs all detected splice junctions 
(linear and circular) in a UCSC BED-like format with extra columns holding 
quality statistics, read counts etc. The original full-length reads that support each 
junction are written to stderr: 


$ bowtie2 -p16 -reorder --mm -M20 --score-min=C, -15, 0 -q -x genome -U 
sample_anchors.qfa.gz | ./find_circ.py -S hg19 -p sample_ -s sample/sites.log > 


sample/sites.bed 2> sample/sites.reads 


The resulting BED-like file is readily filtered for minimal quality cutoffs to pro- 
duce the reported circRNA candidates. In particular, we demanded the following: 
(1) GU/AG flanking the splice sites (built in); (2) unambiguous breakpoint detec- 
tion; (3) a maximum of two mismatches in the extension procedure; (4) the 
breakpoint cannot reside more than 2 nucleotides inside an anchor; (5) at least 
two independent reads (each distinct sequence only counted once per sample) 
support the junction; (6) unique anchor alignments with a safety margin to the 
next-best alignment of at least one anchor above 35 points (~more than two extra 
mismatches in high-quality bases); and (7) a genomic distance between the two 
splice sites of no more than 100 kb (only a small percentage of the data). As the 
ribosomal DNA cluster is part of the C. elegans genome assembly (ce6) and ribo- 
somal pre-RNAs could give rise to circular RNAs by mechanisms independent 
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of the spliceosome, we discarded 130 candidates that mapped to the rDNA cluster 
on chrI:15,060,286- 15,071,020. 

Permutation testing. To test the robustness of the circRNA detection pipeline we 
altered the sequence of real sequencing reads in different ways at the step of 
anchor generation. We (1) reversed either anchor; (2) reversed the complete read; 
(3) randomly reassigned anchors between reads; or (4) reverse complemented the 
read (as a positive control). Although the reverse complement recovered the same 
output as expected, the various permutations led to only very few candidate 
predictions, well below 0.2% of the output with unpermuted reads and in excel- 
lent agreement with the results from simulated reads (Supplementary Fig. 1c). 
HEK293 RNA-seq after rRNA depletion (RibominusSeq). Total HEK293 RNA 
was isolated using Trizol as recommended by the manufacturer. Ribosomal RNA 
was depleted from total RNA using the Ribominus kit (Invitrogen). A cDNA 
library was generated from rRNA-depleted RNA according to the Illumina 
RNA-seq protocol. The cDNA library was sequenced on an Illumina GAIIx by 
a2 X 76 bp run. 

C. elegans oocyte isolation. Oocytes were isolated from worms carrying a tem- 
perature-sensitive (TS) allele for fem-1 (unovulated oocytes BA17[fem-1(hc17ts)] 
strain) and spe-9 (partially ovulated oocytes BA671[spe-9(hc88ts)]) as described 
previously**. Oocytes were washed at least four times in PBS containing protease 
inhibitors (Sigma-Aldrich) to separate from worm debris. Oocyte purity was 
observed under the dissection scope (Zeiss). Oocytes were extracted from young 
adults to enrich for non-endomitotic oocytes, which was also checked by fluo- 
rescence microscopy (Zeiss) with a nuclear dye. Oocytes isolated from fem-1 or 
spe-9 mutant background worms are hereafter referred to as fem-1 oocytes and 
spe-9 oocytes, respectively. 

C. elegans sperm isolation. Sperm was isolated in principle as described prev- 
iously™* from male worms obtained from a fog-2(q71) mutant background. Males 
were cut in cold PBS containing protease inhibitors (Sigma-Aldrich). Sperm was 
subsequently purified by filtration (3 X 401m nylon mesh, 2 X 101m nylon 
mesh) and a series of differential centrifugations (30 min 300g, 10 min 450g) 
and washed twice in cold PBS. Sperm was subsequently activated by incubation 
in PBS containing 200 zgml* Pronase (Sigma-Aldrich) for 30 min at 25°C. 
Sperm purity is around 70% spermatids and spermatozoa contaminated with 
around 30% primary and secondary spermatocytes, as observed under oil immer- 
sion microscope. 

C. elegans isolation of 1-cell- and 2-cell-stage embryos. 1-cell and 2-cell-stage 
embryos were obtained by fluorescence-activated cell sorting as described previ- 
ously**. Microscopic examination of the sorted embryos indicated that the 1-cell- 
stage sample was virtually pure (>98% one-cell stage embryos), whereas the 
2-cell-stage embryo sample was a mixture of 1-cell-stage (40%), 2-cell-stage 
(55%) and older (<5%) embryos. Moreover, purity of the stages was further 
validated by checking for marker gene expression. 

Ribominus RNA preparation from C. elegans samples. We used a kit that was 
developed for human and mouse samples, but still performs sufficiently to enrich 
mRNAs up to 30% in C. elegans. Most of the remaining reads mapped to ribo- 
somal RNAs. 1 1g of total RNA per sample was depleted from rRNAs with the 
Ribominus Transcriptome kit (Invitrogen) according to the manufacturer’s 
instructions with the modification that annealing of LNA probes to total RNA 
was performed in a thermocycler (Eppendorf) with a temperature decrease from 
70 to 37 °Cata rate of 1 °C per min. Depletion of rRNAs was validated by capillary 
gel electrophoresis on a Bioanalyzer (Agilent). The ribominus RNA was then 
processed for sequencing library preparation according to the Illumina protocol. 
Cluster generation and sequencing of C. elegans libraries. Cluster generation as 
well as sequencing of the prepared libraries was performed on the Illumina cluster 
station (Illumina) and sequenced on the HiSeq2000 according to the manufac- 
turer’s protocols (Illumina). 

Human gene models. We obtained gene models for RefSeq transcripts (12 
December 2011), non-coding RNAs*”** and the rnaGene and tRNA tracks from 
the UCSC table browser (23 April 2012)”. 

Intersection of circRNAs with known transcripts. Our computational screen 
identifies only the splice sites that lead to circularization but not the internal exon/ 
intron structure of circular RNAs. To perform analyses of the sequence content of 
circRNAs we therefore inferred as much as possible from annotated transcripts. 
The conservative assumption was that as little as possible should be spliced out. 
On the other hand, coincidence of circRNA splice sites with exonic boundaries 
inside a transcript were considered as an indicator for relevant agreement and 
internal introns appear to be spliced out (Supplementary Fig. 2e). We therefore 
sorted all overlapping transcripts hierarchically by (1) splice-site coincidence 
(2, 1, or 0); (2) total amount of exonic sequence between the splice sites; (3) total 
amount of coding sequence. The latter was used to break ties only and helped the 
annotation process. If one or both splice sites fell into an exon of the best match- 
ing transcript, the corresponding exon boundary was trimmed. Likewise, if it fell 
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into an intron or beyond transcript bounds, the closest exon was extended to 
match the circRNA boundaries. circRNA start/end coordinates were never 
altered. If no annotated exons overlapped the circRNA we assumed a single-exon 
circRNA. The resulting annotation of circRNAs is based on the best matching 
transcript and may in some cases not represent the ideal choice. Changing the 
annotation rules, however, did not substantially change the numbers in Fig. 1d. 
Finding circRNAs conserved between human and mouse. We reasoned that 
when comparing two species, the cutoff of two independent reads in each of them 
could be dropped, as orthologous circRNAs would automatically be supported by 
two independently produced reads via the intersection. We therefore mapped all 
mouse circRNA candidates with less stringent filtering to human genome coor- 
dinates using the UCSC liftOver tool’’. The mapped mouse circRNAs were com- 
pared with independently identified human circRNAs, yielding 229 circRNAs 
with precisely orthologous splice sites between human and mouse. Of these, 223 
were composed exclusively of coding exons and were subsequently used for our 
conservation analysis (Fig. 1f). When intersecting the reported sets of circRNAs 
supported by two independent reads in each species, we found 81 conserved 
circRNAs (supported by at least 4 reads in total). 

Conserved element counting. We downloaded genome-wide human (hg19) 
phyloP conservation score™ tracks derived from genome alignments of placental 
mammals from UCSC”. We interrogated the genome-wide profile inside 
circRNAs in two different ways. (1) Intergenic and intronic circRNAs. We read 
out the conservation scores along the complete circRNA and searched for blocks 
of at least 6-nucleotide length that exceeded a conservation score of 0.3 for 
intergenic and 0.5 for intronic circRNAs. The different cutoffs empirically adjust 
for the different background levels of conservation and were also used on the 
respective controls. For each circRNA, we computed the cumulative length of all 
such blocks and normalized it by the genomic length of the circRNA. Artefacts of 
constant positive conservation scores in the phyloP profile, apparently caused by 
missing alignment data, were removed with an entropy filter (this did not qua- 
litatively affect the results). circRNAs annotated as intronic by the best-match 
procedure explained above that had any overlap with exons in alternative tran- 
scripts on either strand (five cases) were removed from the analysis. The resulting 
distributions are shown in Supplementary Fig. 1h, i. (2) Coding exon circRNAs. 
We used the best-match strategy outlined above to construct an estimated ‘exon- 
chain’ for the circRNAs that overlapped exclusively coding sequence. Using this 
chain we in silico ‘spliced’ out the corresponding blocks of the conservation score 
profile. We kept track of the frame and sorted the conservation scores into 
separate bins for each codon position. In addition to this, we also recorded 
conservation scores in the remaining pieces of coding sequence (‘outside’ the 
circRNA) as a control. However, we observed that the level of conservation is 
systematically different between internal parts of the coding sequence and the 
amino- or carboxy-terminal parts (not shown). We therefore randomly generated 
chains of internal exons, mimicking the exon-number distribution of real 
circRNAs, as a control. When analysing the circRNAs conserved between human 
and mouse, it became furthermore apparent that we also needed to adjust for the 
higher level of overall conservation. High expression generally correlates with 
conservation and thus, an expression cutoff was enforced on the transcripts used 
to generate random controls. This resulted in a good to conservative match with 
the actual circRNAs (Supplementary Fig. 1j, k). 

Overlap of identified circRNAs with published circular RNAs. A number of 
studies in human have reported evidence for circRNAs which derive from exons 
of DCC’, ETS1° and a non-coding RNA from the human INK4/ARF locus® and 
the CDRlas locus’. Additionally, circRNAs from exons of the genes CAMSAP1, 
FBXW4, MANIA2, REXO4, RNF220 and ZKSCAN1 have been recently experi- 
mentally validated’. For the four genes from the latter study, where we had 
ribominus data from the tissues in which these circRNAs were predicted (leuko- 
cytes), we recovered validated circRNAs from all of them (ZKSCAN1, CAMSAP1, 
FBXW4, MANIA2). 

Cell culture and treatments. HEK293 (Fig. 3f), HEK293TN (for virus production) 
and HEK293 Flp-In T-REx 293 (Life Technologies, all other experiments) were 
cultured in Dulbecco’s modified Eagle medium GlutaMax (Gibco) 4.5 gl glucose, 
supplemented with 10% FCS, 20U ml”? penicillin and streptomycin (Gibco) at 
37 °C, 5% CO . Whereas CDRlas/GAPDH ratios were within the given range, we 
observed two- to fivefold variation of CDRlas/vinculin ratios between different 
HEK lines. Transcription was blocked by adding 2,1gml~’ actinomycin D or 
DMSO asa control (Sigma-Aldrich) to the cell culture medium. For in vitro wound 
healing assays, cells were grown to confluency, the cell layer was disrupted using a 
300 pil pipette tip and cells were washed once with medium. Bright-field images of 
cells were taken using a Axio Observer.Z1 (Zeiss) right after setting the scratch and 
24h later. The relative scratch areas were measured using Image] software. 
Quantitative PCR. Total RNA from cell lines was isolated using Trizol (Invi- 
trogen) extraction following the manufacturer’s protocol. Adult B6129SF1/J mice 


were dissected and tissue samples were collected directly into ice-cold Trizol for 
RNA preparation. Caenorhabditis elegans RNA was isolated from about 7,000 
mixed stage worms by two rounds of freeze-thaw lysis in Trizol LS reagent 
(Invitrogen) according to the manufacturer’s protocol. RNA was extracted from 
aqueous phase with phenol:chloroform (Ambion). RNA was precipitated with 
isopropanol and Glycoblue (Ambion) overnight at —20 °C or for 30 min at — 80 °C, 
respectively. Reverse transcription was performed using M-MLV (Promega) or 
Superscript III with oligo(dT) primer (all Invitrogen) or random primer (Meta- 
bion). For assaying mRNA expression level, RT-PCR was performed using SYBR- 
Green Fluorescein (Thermo Scientific, Fermentas) and a StepOnePlus PCR System 
(Applied Biosystems). Expression data in CDRlas knockdown experiments, tran- 
scriptional block and RNase R assays were normalized to C. elegans spike-in RNA. 
To this end 5-10% C. elegans total RNA was added to the respective Trizol sample 
and qPCR primer for ama-1 or eif-3.d were used. Mouse expression data were 
normalized to Actb. miRNA expression levels were assayed using TaqMan 
microRNA assays (Applied Biosystems) and normalized to sno-234. Expression 
levels of circRNAs described in this study were measured by qPCR using divergent 
primers. A list of primer sequences is available in Supplementary Table 8. 

PCR amplification and Sanger sequencing. DNA templates were PCR amplified 
using BioRad Mastercyclers and ThermoScientific DreamTaq Green PCR Master 
Mix according to the manufacturer’s protocol. We performed 35 cycles of PCR. 
PCR products were visualized after electrophoresis in 2% ethidium bromide- 
stained agarose gel. To confirm the PCR results, the PCR products were purified 
through Agencourt AMPure XP PCR purification kit. Direct PCR product Sanger 
sequencing was performed by LGC Genomics Ready2 Run services. Primer P1 
was provided for sequencing the product for each candidate. 

Primer design. Divergent primers were designed for each candidate (P1, P2) to 
anneal at the distal ends of its sequence. As negative controls we used divergent 
primers for GAPDH and ACTB linear transcript in HEK293 cells, and eIF-3.D in 
C. elegans. As a further negative control for divergent primers, we used genomic 
DNA extracted through Qiagen DNeasy Blood & Tissue kit. As positive controls, 
we used convergent primers for the corresponding linear transcripts or for house- 
keeping genes (eIF-3.D for C. elegans). 

RNase R treatment. HEK293 DNase-treated total RNA (5 |g) was incubated 
15min at 37°C with or without 3Upg ' of RNase R (Epicentre Bio- 
technologies). RNA was subsequently purified by phenol-chloroform extraction, 
retro-transcribed through Superscript SSIII (Invitrogen) according to the man- 
ufacturer’s protocol, and used in qPCR. 

RNA nicking assay. For partial alkaline hydrolysis (nicking) 1pgpl' of 
HEK293 total RNA was incubated in 50 mM NaHCO; for 2.5 or 5 min at 90 °C 
or 5min on ice for controls. After incubation the samples were immediately re- 
suspended in denaturing RNA sample buffer and analysed on northern blots. 
Northern blotting. Total RNA (10-20 1g) was loaded on a 1.2% agarose gel 
containing 1% formaldehyde and run for 2-2.5 h in MOPS buffer. 

The gel was soaked in 1X TBE for 20 min and transferred to a Hybond-N* 

membrane (GE Healthcare) for 1h (15 V) using a semi-dry blotting system (Bio- 
Rad). Membranes were dried and ultraviolet-crosslinked (at 265nm) 1X at 
200,000 LJ cm”. Pre-hybridization was done at 42°C for 1h and 2D_labelled 
oligonucleotide DNA probes were hybridized overnight. The membranes were 
washed briefly in 2 SSC, 0.1% SDS at room temperature and two additional 
times at 55 °C for 30 min, followed by two 30-min washes in 0.2 SSC, 0.1% SDS 
at 50-55 °C. For data collection, the membrane was exposed to a phosphoimager 
screen. 
Genome alignments for detecting miRNA seed complementary sites. Multiple 
species alignments for the genomic intervals, corresponding to circRNAs pre- 
dicted in C. elegans (ce6), human (hg19) and mouse (mm49), were generated via 
the Galaxy server at UCSC**'. In case that a circle was overlapping with an 
annotated transcript, the inferred spliced sequence was used for retrieving the 
alignments. 

The alignments included C. elegans, C. briggsae and C. remanei in the first case 
and Homo sapiens, Mus musculus, Rattus norvegicus, Bos taurus and Canis famili- 
aris in the latter two. 

C. elegans human and mouse miRNAs. Fasta files with C. elegans, human and 
mouse miRNAs were obtained from miRBase release 16 (ref. 62). Only mature 
miRNAs were considered for the seed analysis. According to miRBase 16 a 
mature miRNA is the predominant miRNA between the two species arising from 
the two arms of the precursor hairpin (information that is not included in more 
recent versions).The miRNAs were grouped into families that share a common 
seed (nucleotides 2-7). There are 117, 751 and 723 miRNA families for C. elegans, 
human and mouse, respectively. 

Detecting putative miRNA seed matches. The C. elegans, human and mouse 
multiple species alignments were scanned for putative conserved miRNA target 
sites for each of the mature miRNA families. A putative target site of amiRNA is a 
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6-nucleotide-long sequence in the genome that is the reverse complement of 
nucleotides 2-7 of the mature miRNA sequence. A putative target site is called 
conserved if it is found in C. elegans, C. briggsae and C. remanei in the first case or 
in human, mouse, rat, cow and dog in the latter. 

AGO PAR-CLIP. Generation and growth conditions of human embryonic kid- 
ney (HEK) 293 cells and HEK293 stably expressing Flag/HA-AGO1 and Flag/ 
HA-AGO2? were reported previously”. Stably transfected and parental HEK293 
cells were labelled with 100 1M 4-thiouridine for 16h. After labelling, procedure 
followed the PAR-CLIP protocol as described*’. Briefly, ultraviolet-irradiated 
cells were lysed in NP-40 lysis buffer. Immunoprecipitation was carried out with 
protein G magnetic beads (Invitrogen) coupled to anti-Flag antibody (Sigma) and 
to anti-AGO2 antibody™ from extracts of stably transfected and parental 
HEK293 cells, respectively, for 1 h at 4°C. Beads were treated with calf intestinal 
phosphatase (NEB) and radioactively end-labelled by T4 polynucleotide kinase 
(Fermentas). The crosslinked protein-RNA complexes were resolved on 4-12% 
NuPAGE gel (Invitrogen), and a labelled protein-RNA complex of close to 
100 kDa was excised. The protein-RNA was isolated by electroelution. RNA 
was isolated by proteinase K treatment and phenol-chloroform extraction, 
reverse transcribed and PCR-amplified. The amplified cDNA was sequenced 
on a GAIIx (Illumina) with 36 cycles. 

Human Argonaute PAR-CLIP analysis. We obtained Argonaute PAR-CLIP 
reads from ref. 32. We additionally produced 4 PAR-CLIP libraries. In total, 
we analysed the following PAR-CLIP data sets: AGO1_4su_1 (SRR048973), 
AGO3_4su_1 (SRR048976) from ref. 32; AGO1_4su_LML_MM_6, AGO2_ 
4su_ML_MM_7, AGO2_4su-ML_MM_8, and AGO2_4su_3_ML_LG (our 
own data, published under GEO accession GSE43574). 

Redundant reads were collapsed (such that each distinct read sequence appears 
only once), aligned to the human genome (assembly hg19) using bwa 0.6.1-r104 
(ref. 65), and analysed by our in-house PAR-CLIP analysis pipeline (Jens, M. 
et al., unpublished), essentially as described in ref. 33. Briefly, reads uniquely 
aligning to the genome are grouped into clusters contiguously covering the ref- 
erence, assigning each cluster a number of quality scores (T conversions, number 
of independent reads, etc). Clusters with less than 3 reads from 3 of 6 independent 
AGO PAR-CLIP libraries or lacking T conversions were discarded. Remaining 
clusters are annotated against a comprehensive list of transcript models (see 
below) and collected into ‘only sense’, ‘only antisense’ and ‘intergenic/overlapping 
transcription’ categories based on their annotation. As PAR-CLIP sequencing 
preserves the directionality of RNA fragments we assume ‘only antisense’ clusters 
to predominantly represent false positives due to mapping artefacts (PAR-CLIP 
RNA is mutated and fragments are often short), and choose quality cutoffs for all 
clusters such that the fraction of kept ‘only antisense’ clusters is reduced to below 
5%. Remaining ‘only antisense’ clusters were discarded. For Fig. 3a, uniquely 
aligning, collapsed reads are shown. 

AGO binding sites in C. elegans. Sequencing reads from the Zisoulis Alg-1 
HITS-Clip data were obtained from http://yeolab.ucsd.edu/yeolab/Papers_files/ 
ALG1_MT-WT_raw.tar.gz (ref. 66). The raw sequencing data of the wild-type 
Alg-1 HITS-CLIP was pre-processed and mapped with the mapper module from 
miRDeep2 (ref. 74). The pre-processed reads were mapped with bowtie version 
0.12.7 (ref. 67) to the C. elegans genome (ce6). All reads that overlapped when 
mapped to the genome were merged into bigger regions (islands). Read counts 
were averaged. This resulted in 24,910 islands in the C. elegans genome. 
Analysis of sequence conservation in CDRlas. Genome alignments of 32 verte- 
brates were downloaded from the UCSC database (hg19)” and analysed for the 
CDRIas locus. Primate species other than human were discarded to not bias the 
analyses. The one species (cow) with more than 50% gaps in the CDR1as locus 
was also discarded. The alignments for the seed regions were then corrected. 
Specifically, bases that would clearly align with the seed but had been separated 
in the alignment by runs of gaps were re-aligned. These corrections were neces- 
sary in less than 1% of all seed sites. 

For an in-depth analysis we BLATed® the human CDR 1as sequence with 20- 
nucleotide flanking region against all vertebrate genomes in the UCSC genome 
browser and kept only hits that in turn aligned best to the human locus. The 
resulting sequences were used to build a multiple species alignment with 
MUSCLE”. The same corrections were applied as described above. This align- 
ment was also used for Supplementary Fig. 4. Entropy was calculated in log, units 
and averaged across all alignment columns bracketing each human seed site by 
maximally 8 nucleotides. 

Analysis of miR-7 base-pairing within CDRlas. RNAcofold” was used to co- 
fold miR-7 with each of the 74 binding regions within CDRlas defined as the 
miR-7 seed match TCTTCC and the next 16 bases upstream. 

Single-molecule RNA fluorescence in situ hybridization (smRNA FISH). 48 
oligonucleotide probes (20 nucleotides length; spacing 2 nucleotides) comple- 
mentary to the CDRlas transcript were designed using the Stellaris Probe 
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Designer version 2.0 (Biosearch Technologies) with a masking level of 4 on the 
human genome to achieve high probe specificity (Supplementary Table 8). 
Stellaris probe pools were obtained from BioCat GmbH as conjugates coupled 
to Quasar 670 (a Cy 5 replacement). Flp-In T-REx 293 cells (Life Technologies) 
were grown exponentially and seeded into LabTek 4-well chambered coverslips 
(1 to 2X 10° cells per well). Hybridizations were performed according to the 
manufacturer’s instructions with 50ng ml! DAPI as nuclear counterstain; 
Stellaris probes were hybridized at 125nM concentration with a stringency of 
10% formamide in overnight hybridizations at 37 °C. Images were acquired on an 
inverted Nikon Ti microscope with a Hamatsu ORCA R2 CCD camera, a 60 
NA 1.4 oil objective and Nikon NIS-Elements Ar software (version 4), using an 
exposure time of 50 ms for DAPI and 1-1.5s for Quasar 670. Groups of cells for 
imaging were chosen in the DAPI channel; Z-stacks were acquired in the Quasar 
670 channel using 0.3 um spacing and comprised a total depth of 6.5 ym (5 um 
below and 1.5 ttm above the middle of the nucleus) and merged using maximum 
intensity. 

Mouse strains and in situ hybridizations. All mice were bred and maintained in 
the animal facility of the Max Delbriick Centrum under specific pathogen-free 
conditions, in plastic cages with regular chow and water ad libitum. All aspects of 
animal care and experimental protocols were approved by the Berlin Animal 
Review Board (REG 0441/09). B6129SF1/J wild-type adult, newborns (postnatal 
day 1) or pregnant females (plug detection at day 0.5; embryo collection at day 
13.5) were used, as indicated for each experiment, to obtain the tissues needed for 
RNA analysis and in situ hybridizations (ISH). After death, embryos or tissues 
were immediately frozen in liquid nitrogen and stored at —70 °C, or fixed for ISH. 

Mouse brain structures were collected and named according to the anatomical 
guidelines of the Gene Expression Nervous System Atlas of the Rockefeller 
University (http://www.gensat.org) and the Mouse Brain Atlas (http://www. 
mbl.org/mbl_main/atlas.html). 

For the RNA analysis and to clone CDRlas-specific RNA probes, two adult 
1-year-old mice of both sexes were dissected, total RNA prepared and analysed. If 
embryos or newborns were sectioned, a minimum of two specimens were eval- 
uated; in some instances up to 5 specimens were used. 

For ISH, samples were fixed in formalin (1X PBS; 4% formaldehyde) for 12h 
and post-fixed (70% ethanol, 18 h) before dehydrating and paraffin-embedding. 
Next, the organs were perfused with a standard protocol using a Shandon XP 
Hypercentre. For ISH mouse embryos or organs were cut in RNase-free condi- 
tions at 6 tm and ISH was performed as described with digoxigenin (DIG)- 
labelled RNA probes. All DIG-RNA probes were hybridized at 58 °C overnight. A 
total of 600 ng of the labelled probes was used per slide. 

To amplify Cdr1 sense and antisense sequences for ISH probe preparation a 
standard PCR-amplification was performed using mouse cerebellum cDNA. 
Three Cdrlas amplicons were generated, two of which probes are meant for 
the detection of both linear and circular forms using mmuCdrl_lf 5’- 
TGCCAGTACCAAGGTCTTCC-3’ and mmuCdrl_lIr 5’-TTTTCTGCTGGA 
AGATGTCAA-3’, as well as mmuCdrl1_2f 5’-CCAGACAATCGTGATCT 
TCC-3’ and mmuCdr1_2r 5'-ATCTTGGCTGGAAGACTTGG-3’. In addition 
a probe was generated, specific to the circular probe, using the divergent primers 
mmuCdrl_as_7f 5’-CCACATCTTCCAGCATCTTT-3’ and mmuCdrl_as_ 
7r 5'-TGGATCCCTTGGAAGACAAA-3’ (CDR1_as head to tail probe). All 
ensuing fragments were subcloned into pCR-BluntII-TOPO (Invitrogen) and 
verified by sequencing. Linearized plasmids were amenable for in vitro transcrip- 
tion using the T7 (antisense) or SP6 (sense) polymerase and a DIG-label nucleo- 
tide mixture according to manufacturer’s instruction (Roche Applied Science). 

LNA ISHs were performed according to a protocol suggested by the manufac- 
turer (Exiqon) with minor modifications. For individual LNAs, specific protocols 
were run at 51°C (miR-7; 38485-15) or 58°C (miR-124; 88066-15) on an 
InsituPro VS robot (Intavis). A pre-hybridization step was added, which con- 
sisted of an incubation of the slides at 15 °C lower than the hybridization tem- 
perature for 30min using hybridization buffer. The antibody-blocking step 
was performed in the presence of 1% mouse blocking reagent (Roche 
11096176001) and 10% sheep serum. The LNA probes were used at the following 
concentrations: miR-7 40nM; miR-124 20nM; U6 snRNA 1nM; scrambled 
40 nM, as suggested by miRCURY LNA microRNA ISH Optimization kit 
(Exiqon; 90004). Before detection all slides were washed 4X in NTMT including 
1mM Levamisole. The doubly DIG-labelled LNAs were detected by the alkaline 
phosphatase using the substrate BM-purple (Roche; 11442074001) at 37 °C. 
siRNA- and shRNA-mediated knock down. CDRlas was knocked down using 
custom designed siRNA oligonucleotides (Sigma) and Lipofectamine RNAiMax 
(Invitrogen). 2 10° HEK293 cells were transfected with 10nM siRNA duplex 
following the manufacturer’s protocol. After 12-16h cells were harvested and 
subjected to RNA analysis. For stable knock down of CDRlas, 293TN cells were 
co-transfected with the packaging plasmids pLP1, pLP2 and the VSV-G plasmid 
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(Invitrogen) and pSicoR constructs’! (sequences available in the Supplementary 
Table 8) by calcium phosphate transfections. Viral supernatants were harvested 
after 24h and 48h post transfection and filtered through a 0.44 1m filter. For 
infection the viral supernatants supplemented with fresh medium and 6 jg ml! 
polybrene was added to target cells. After overnight infection cells were allowed to 
recover for 12h and subjected to a second round of infection. Cells were collected 
48-72 h after the first infection. The list of siRNA oligonucleotides is provided in 
Supplementary Table 8. 

Zebrafish methods. Zebrafish and their embryos were handled according to 
standard protocols” and in accordance with Max Delbriick Centrum insti- 
tutional ethical guidelines. The Tg(huC:egfp) and the Tg(Xia. Tubb:dsRED) trans- 
genic zebrafish lines have been described elsewhere*”*. Morpholino antisense 
oligomers (Gene Tools) were prepared at a stock concentration of 1 mM accord- 
ing to the manufacturer’s protocol. Sequences: control morpholino, 5’-CTC 
TTACCTCAGTTACAATTTATA-3’ (control morpholino) and morpholino tar- 
geting miR-7, 5’-ACAACAAAATCACAAGTCTTCCACA-3’ (miR-7 morpho- 
lino). For titration experiments we used 15 ng of control morpholino and 9 and 
15 ng of miR-7 morpholino; for all other experiments we used 9 ng miR-7 mor- 
pholino. 3 nl of morpholinos were injected into the yolk of single-cell-stage 
embryos. 

A 673-nucleotide mouse Cdrlas fragment was amplified from mouse cerebel- 
lar cDNA and the amplicon was subcloned into a pCR-Blunt II Topo vector 
(Invitrogen). The vector was linearized with KpnI or Apal (Fermentas) in vitro 
transcribed (IVT) using T7 and SP6 RNA polymerases (Promega) and the result- 
ing Cdrlas and reverse complement Cdrlas_control products were used for 
injections (1.5nl of 100 ngnl *) into the cell of single-cell-stage embryos. In a 
repetition of these experiments the Cdrlas fragment amplicon was directly used 
as a template for IVT by exploiting T7-promoter extended forward and reverse 
primer. 

Approximately 1.5 nl of a 50 ng pl _' construct (backbone pCS2+) expressing 
the human linear or the human circular CDRlas was injected into the cell of 
single-cell-stage embryos (provided by the Kjems laboratory). For rescue experi- 
ments the construct containing the circular CDRlas was injected together with 
1.5 nl pre-miR-7 precursor (7 |.M, pre-miR miRNA precursor ID PM10047 from 
Applied Biosystems). The negative control was the vector pCS2+ without insert 
(empty vector, 50 ng jl’). 

Confocal imaging was performed using a Zeiss LSM 510 microscope (Carl 
Zeiss MicroImaging) equipped with a 25x objective (NA = 0.8). Embryos were 
anaesthetized using 0.1% tricaine and mounted in 1% agarose as described”. 
Confocal stacks were acquired of the brain region and processed using Zeiss 
ZEN software. Midbrain and telencephalon volumes were calculated using 
Imaris 64X7.6.1 software based on high-resolution three-dimensional stacks 
obtained from Tg(Xia.Tubb:dsRED) embryos. Reduced midbrain development 
was defined as >50% smaller than the mean size of controls. 

Each experimental group was evaluated in at least three independent experi- 
ments; a minimum of 80 individual embryos per group were examined. Data are 
expressed as mean + standard deviation. Statistical analysis was performed using 
Students’t-test, and a P< 0.05 was considered statistically significant. 

Expression of miR-7 in zebrafish embryos at 48 hours post fertilization was 
normalized to expression of B-actin. In the miR-7 morpholino group, only 
embryos with a midbrain phenotype were used for the RNA expression analysis. 


dre B-actin forward primer, 5’-TGCTGTTTTCCCCTCCATTG-3’; reverse pri- 
mer, 5’-TTCTGTCCCATGCCAACCA-3’; probe sequence FAM-5’-TGGAC 
GACCCAGACATCAGGGAGTG-3'-TAMRA. 

For measuring the expression of dre-miR-7a/b we used Applied Biosystems 
TaqMan miR assays (1D000268, ID001088). 
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ATPase-dependent quality control of 
DNA replication origin licensing 


Jordi Frigola’, Dirk Remus'+, Amina Mehanna! & John F. X. Diffley! 


The regulated loading of the Mcm2-7 DNA helicase (comprising six related subunits, Mcm2 to Mcm/7) into pre-replicative 
complexes at multiple replication origins ensures precise once per cell cycle replication in eukaryotic cells. The origin 
recognition complex (ORC), Cdc6 and Cdtl load Mcm2-7 into a double hexamer bound around duplex DNA in an 
ATP-dependent reaction, but the molecular mechanism of this origin ‘licensing’ is still poorly understood. Here we 
show that both Mcm2-7 hexamers in Saccharomyces cerevisiae are recruited to origins by an essential, conserved 
carboxy-terminal domain of Mcm3 that interacts with and stimulates the ATPase activity of ORC-Cdc6o. ATP 
hydrolysis can promote Mcm2-7 loading, but can also promote Mcm2-7 release if components are missing or if ORC 
has been inactivated by cyclin-dependent kinase phosphorylation. Our work provides new insights into how origins are 
licensed and reveals a novel ATPase-dependent mechanism contributing to precise once per cell cycle replication. 


Stable genome inheritance requires that replication origins initiate 
efficiently during S phase, and that re-initiation from these origins 
is subsequently prevented. This is accomplished by first licensing 
origins during G1 phase with a pre-replicative complex containing 
an inactive double hexamer of the Mcm2-7 helicase, and then activat- 
ing the helicase during S phase, when pre-replicative complexes can 
no longer be assembled’ ®. 


Mcm3 recruits Cdtl-Mcm2-7 to ORC-Cdc6 


We have previously described a system in which pre-replicative com- 
plexes can be assembled on DNA with purified proteins’. When ATP 
is present, ORC and Cdcé6 load Cdtl-Mcm2-7 heptamers into a salt- 
resistant Mcm2-7 double hexamer with concomitant release of Cdc6 
and Cdt1; in ATP-yS (a slowly hydrolysed analogue of ATP) all pre- 
replicative complex components are recruited in low salt but are 
removed by a high-salt wash (HSW) (Supplementary Fig. la). To 
investigate the individual roles of Cdt1 and Mcm2-7, we purified each 
separately (Supplementary Fig. lb-d) and showed each protein was 
functional for loading when re-assembled into a Cdtl1-Mcm2-7 com- 
plex (Supplementary Fig. 2). Consistent with our previous results’, 
none of the pre-replicative complex components bound to DNA 
under recruitment conditions (ATP-yS, low salt) in the absence of 
ORC (Fig. 1A, lanes 1-6). In the absence of Cdc6 (lanes 7-9), ORC 
bound DNA, but could not recruit Cdtl, Mcm2-7 or Cdtl-Mcm2-7. 
Cdc6 was recruited to DNA in an ORC-dependent manner (compare 
lanes 4 and 10) consistent with the formation of an essential ORC- 
Cdc6 complex'®. Cdt1 was not recruited in the presence of ORC-Cdc6 
(lane 10), but we found significant ORC- and Cdc6-dependent 
recruitment of Mcm2-7 subunits in the absence of Cdtl (compare 
lane 11 to lanes 6 and 9). Cdt1 was only recruited when Mcm2-7 was 
present along with ORC and Cdcé6 (lane 12). From this we conclude 
that Mcm2-7 can be recruited to ORC-Cdc6 without Cdt1 and that 
Cdt1 recruitment requires Mcm2-7. 

The absence of Cdt1 did not affect Mcm7 recruitment, but did 
reduce the amount of Mcm2 recruited (Fig. 1A, compare lanes 11 
and 12). To explore this further, we tested for the presence of each 
of the six Mcm2-7 subunits after Mcm2-7 recruitment in ATP-yS in 


the presence or absence of Cdt1 using silver staining (Fig. 1B, a) and 
immunoblotting (Fig. 1B, b). We found that Mcm3, 5 and 7 were 
recruited to ORC-Cdc6 similarly in the presence and absence of 
Cdt1 (Fig. 1B); the recruitment of Mcm2, 4 and 6, however, was 
significantly reduced without Cdt1l. This suggests that Mcm3, 5 and 
7, which are immediate neighbours in the Mcm2-7 ring, are recruited 
to ORC-Cdcé directly without Cdtl, but Cdtl plays some role in 
recruiting Mcm2, 4 and 6 (Fig. 1B, c). 


The extreme C terminus of Mcm3 recruits Cdtl-Mcm2-7 


We took several approaches to address which Mcm2-7 subunits were 
involved in the Cdt1-independent recruitment. We tested each of the 
six Mcm2-7 subunits individually and found that only Mcm3 could 
be recruited in a Cdc6-dependent manner without the other subunits 
(Supplementary Fig. 3a, b, f). We next assembled Mcm2-7 complexes 
containing all subunit or lacking either Mcm3 or Mcm4 (A3 and A4, 
respectively; Supplementary Fig. 3c, d). Mcm5 and 7 were recruited 
to ORC-Cdc6 in the absence of Cdtl by the complete complex 
(Supplementary Fig. 3f, lanes 1 and 5) and by A4 (lanes 3 and 7), 
but not by A3 (lanes 2 and 6). Taken together, these results indicate 
that Mcm3 is critical for recruiting Mcm5 and 7 to ORC-Cdc6 in the 
absence of Cdtl. 

Mcm3 comprises an amino-terminal domain (NTD) anda AAA+ 
domain, which are both found in all Mcm2-7 subunits'!’, and an 
extended C-terminal tail of unknown function containing a conserved 
domain at the extreme C terminus (‘C’ in Fig. 2a) that is not found in 
other Mcm2-7 subunits (Fig. 2a). A version of Mcm3 containing a 
3 Flag tag at its C terminus could be assembled into a stable hepta- 
meric Cdtl-Mcm2-7 complex, but was unable to recruit or load 
Mcm2-7 (Supplementary Fig. 4). Moreover, a small fragment con- 
taining the C-terminal 194 amino acids residues of Mcm3 was recruited 
in an ORC- and Cdc6-dependent manner (Fig. 2b). These results 
pointed to the C terminus as being crucial for Mcm3 recruitment. 

Based on the conservation of the Mcm3 C terminus (Fig. 2a), and 
the fact that a tag at the C terminus interfered with its function, we 
generated a series of C domain amino acid substitution mutants in 
full-length Mcm3 (Fig. 2a). We assembled the mutant proteins into 
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Figure 1 | Mcm3 is necessary and sufficient for Mcm2-7 recruitment. 

A, Protein requirements for Mcm2-7 recruitment. ORC, Cdc6, Cdtl and 
Mcm2-7 were purified as described in Methods and in Supplementary Fig. 1. 
After incubation of the indicated proteins with DNA beads in the presence of 
ATP-yS, beads were isolated and washed with low salt. DNA was uncoupled 
with irradiation as described in Methods and bound proteins were analysed by 
immunoblot with the indicated antibodies. B, Recruitment of Mcm2-7 by ORC 
and Cdc6 was performed as in A. Reactions contained Mcm2-7 either with (+) 
or without (—) Cdt1. B, a-c, Bound proteins were visualized by silver staining 
(B, a) or immunoblot (B, b). Panel B, c summarizes the results in B, a, b. 


Cdtl-Mcm2-7 complexes (Supplementary Fig. 5) and tested their 
ability to recruit and load Mcm2-7. The single mutants Mcm3-11 
and Mcm3-12, and the Mcm3-13 double mutant were completely 
defective in recruitment of Cdtl-Mcm2-7 to ORC-Cdc6 (Fig. 2c, 
lanes 2-4), as was Mcm3 lacking its entire C-terminal domain 
(3AC, Fig. 2c, lane 8). In these experiments, Orcl serves as a useful 
loading control. These mutant proteins were also not recruited when 
tested individually in the absence of Cdtl and the other Mcm2-7 
subunits (Supplementary Fig. 6). Consistent with their defect in 
recruitment, Mcm3-11 and Mcm3AC were unable to load Mcm2-7 
(Fig. 2d, lanes 2 and 6). Mutation of other conserved residues 
(Mcm3-16, Mcm3-22 and Mcm3-23) generated proteins that were 
less defective, showing reduced recruitment (Fig. 2c, lanes 5-7 and 
Supplementary Fig. 6) and a commensurate reduction in loading 
(Fig. 2d, lanes 3-5). Therefore, the C terminus of Mcm3 is essential 
for recruitment of all Mcm2-7 subunits. As Mcm2-7 containing 
Mcm3-11, Mcm3-12 or Mcm3-13 is not recruited even when bound 
to Cdtl, this indicates that Cdtl cannot recruit Mcm2-7 to ORC- 
Cdcé directly. The reduced association of Mcm2, 4, 6 with ORC-Cdc6 
(Fig. 1A, B) therefore probably reflects a role for Cdt1 in retaining or 
stabilizing these subunits after initial recruitment by Mcm3. 

We next expressed the mutant Mcm3 proteins in yeast strains 
harbouring a temperature-sensitive degron mutant of MCM3 
(mcm3-td)'*. This degron mutant grew well at 25 °C, but not at the 
restrictive temperature, 37 °C (Fig. 2e, row 1). The growth defect at 
37°C was suppressed by expression of wild-type Mcm3 (Mcm3, 
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row 2), but not by expression of Mcm3-11, Mcm3-12 or Mcm3-13 
(rows 3-5) indicating that these proteins are not functional in vivo. 
Expression of these proteins significantly reduced growth even at 
25°C when the wild-type degron fusion protein was functional 
(middle panel, GAL 25 °C, compare row 1 with rows 3-5), indicating 
that these mutants act as dominant-negative inhibitors when over- 
expressed. Some of the other mutants (Mcm3-16, Mcm3-22, Mcm3- 
23) showed mild growth defects at 37 °C, consistent with their partial 
defects in Mcm2-7 loading. Supplementary Fig. 7 shows that mcm3-11 
supported only very slow growth when present as an only copy, 
whereas mcm3-12 and mcm3-13 did not support growth. Taken 
together, these results indicate that a domain at the extreme C terminus 
of Mcm3 is necessary for recruiting Cdtl/Mcm2-7 to ORC-Cdc6, and 
that this function is essential for viability. 


Both Mcm2-7 hexamers require the C terminus of Mcm3 
The results in Figs 1 and 2 show that recruitment of the first Mcm2-7 
hexamer occurs by interaction of Mcm3 with ORC-Cdc6. If recruit- 
ment of the second hexamer occurred by a different mechanism, one 
that didn’t involve Mcm3-ORC-Cdc6 interaction but did require 
recruitment of the first hexamer, then the Mcm3-11 mutant should 
be able to be recruited as the second hexamer in the presence of wild- 
type Cdtl-Mcm2-7. To test this, we first generated two different 
forms of Mcm3 that could be distinguished after SDS—polyacrylamide 
gel electrophoresis (SDS-PAGE), one containing an N-terminal 
fusion to the Flag tag (N’ 3 x Flag~Mcm3), the other containing a larger 
N-terminal fusion to maltose binding protein (N’ MBP-Mcm3). Both 
proteins supported similar levels of recruitment and loading, and 
both complemented the mcm3-td mutant at 37°C (Supplementary 
Fig. 8). We engineered the Mcm3-11 mutation into either the MBP 
fusion (Fig. 3a) or the Flag fusion (Fig. 3b), generated mutant Cdt1l- 
Mcm2-7 complexes and mixed these complexes in different ratios with 
Cdtl-Mcm2-7 containing either wild-type Flag-Mcm3 (Fig. 3a) or 
wild-type MBP-Mcm3WT (Fig. 3b). These experiments show that the 
Mcm3-11 mutant was not recruited above background levels in ATP or 
ATP-YS over a wide range of wild type:mutant ratios, showing that the 
wild-type protein cannot aid recruitment of the Mcm3-11 mutant 
complex, indicating that recruitment of both hexamers requires the 
Mcm3 C terminus. 


Mcm3 binding activates the ORC-Cdc6 ATPase 


ATP hydrolysis is required for assembly of the Mcm2-7 double hex- 
amer, but what stage of assembly requires ATP hydrolysis is not 
known. We therefore tested whether Mcm3 binding affected the 
ATPase activity of DNA-bound ORC-Cdcé. Neither Mcm3 nor the 
Mcm3-13 mutant showed appreciable ATPase activity by themselves 
(Fig. 4a), consistent with the fact that, asa AAA+ ATPase, Mcm3 
requires an arginine finger from a binding partner (in this case, 
Mcm5) to promote ATP hydrolysis. ORC, together with Cdc6 
(ORC-Cdc6, Fig. 4a), had some ATPase activity in the presence of 
DNA, which might reflect a basal level of ATPase from DNA-bound 
ORC-Cdc6, or might be caused by ORC and/or Cdcé6 that is not 
bound to DNA. Addition of Mcm3, but not Mcm3-13, to ORC- 
Cdc6 resulted in a three- to fourfold increase in ATPase activity, 
indicating that the interaction of the C terminus of Mcm3 stimulates 
the ATPase of ORC-Cdc6. To rule out any contribution from the 
Mcm3 ATPase, we examined the effect of the Mcm3 C-terminal 
fragment lacking the AAA+ domain. Figure 4b shows that this 
domain was as effective as full-length Mcm3 in stimulating the 
ATPase activity of ORC-Cdc6. Reactions in Fig. 4b contained 
2.5 pmol each of ORC and Cdc6, yet addition of Mcm3 resulted in 
approximately 140 pmol of additional ATP hydrolysis (above the level 
produced by ORC and Cdcé6 alone) over 20 min, indicating that 
Mcm3 binding can induce multiple rounds of ATP binding and 
hydrolysis by ORC-Cdcé. 
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Figure 2 | The C terminus of Mcm3 is required 
for Mcm2-7 recruitment. a, The domain 
architecture of Mcm3 and alignment of the Mcm3 
C terminus. The alignment includes Mcm3 from a 
variety of eukaryotic species (Sc, Saccharomyces 
cerevisiae; Kl, Kluyveromyces lactis; Yl, Yarrowia 
lipolytica; Sp, Schizosaccharomyces pombe; Hs, 
Homo sapiens; X1, Xenopus laevis; Dm, Drosophila 
melanogaster). Residue numbers above alignment 
correspond to S. cerevisiae Mcm3. The position of 
various mutants is shown by vertical lines, and the 
mutant amino acid residue is shown at the bottom 
of the line. Allele names are on the right. The 
numbering of alleles begins with 11 to prevent 
confusion with any existing mcm3 mutant alleles. 
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early in the Mcm2-7 loading reaction. Supplementary Fig. 9a shows 
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saw with non-hydrolysable ATP-yS (Supplementary Fig. 3a, f and 
Fig. 5a, ATP-yS) was lost when hydrolysis was allowed (Fig. 5a, 
ATP). Similarly, the retention of Mcm3, Mcm5 and Mcm7 seen in 
ATP-YS in the absence of Cdt1 (for example, Fig. 1) was also lost in the 
presence of ATP (Fig. 5b, compare lanes 2 and 6). These results 
indicate that ATP hydrolysis can lead to release of Mcm3 (and asso- 
ciated subunits) when the loading of incomplete Cdt1-Mcm2-7 com- 
plexes is attempted. 

This could be because the remaining Mcm2-7 subunits cannot 
form stable, topologically closed double hexamers around DNA and 
are, therefore, released during a futile loading reaction with ATP 
hydrolysis. ORC is not required for Mcm2-7 to remain bound topo- 
logically to the DNA after loading (for example, after a high-salt 
wash). So, we next looked at the ability of the full, wild-type Cdtl- 
Mcm2-7 complex to be recruited and loaded by ORC lacking the 
Orc6 subunit (ORCAG), which can still bind DNA. ORCA6 was able 
to recruit all six Mcm2-7 subunits in the presence of ATP-yS as 
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Figure 4 | Mcm3 binding activates the ORC-Cdc6 ATPase. a, b, The 
conversion of [«°?P]ATP to [«°?P]ADP was monitored as described in 
Methods in reactions containing the indicated proteins (Mcm3 wild type; 
Mcm3-13 mutant; Mcm3-CT, a polypeptide containing the C-terminal 194 
amino acid residues of Mcm3; ORC and Cdc6). Error bars depict standard error 
of the mean from five reactions. 
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Figure 5 | ATP hydrolysis by ORC-Cdc6 can promote Mcm2-7 release. 

a, Mcm3 was tested for recruitment (low-salt wash) in reactions containing 
ORC, Cdc6 and either ATP or ATP-7S. Bound proteins were analysed by silver 
staining (silver) or immunoblotting with anti-Mcm3 antibody (immuno). 

b, Recruitment and loading assays were performed as above with ORC, Cdc6 
and Mcm2-7 either with (+) or without (—) Cdtl in ATP or ATP-YS. Proteins 
were visualized by silver staining. c, Complexes containing either the complete 
six-subunit ORC complex (All) or a complex lacking Orc6 (A6), purified as 
described in Methods, were tested for recruitment (low-salt wash) in either 


efficiently as the complete ORC complex (Fig. 5c, lanes 3 and 4). 
Although the recruited complex was slightly more salt-sensitive than 
the complex recruited with the complete ORC complex (Supplemen- 
tary Fig. 9b), the only significant difference between Fig. 5c lanes 3 and 
4 was the absence of Orcé in lane 4. Recruitment of Mcm2-7 subunits 
in the absence of Orcé still required the C terminus of Mcm3, because 
it was completely defective in the Mcm3-11 mutant (Supplementary 
Fig. 9d, lane 4). Notably, although ORCA6 recruited all six Mcm2-7 
subunits in ATP-7S, it was unable to stably recruit any of these sub- 
units in the presence of ATP (compare Fig. 5c, lanes 2 and 4). ORCA6 
and Cdcé6 are bound equally well in ATP and ATP-yS (Supplemen- 
tary Fig. 9a). Thus, although the entire Mcm2-7 ring is efficiently 
recruited, ATP hydrolysis promotes release of Cdtl-Mcm2-7 in the 
absence of Orc6. 

To rule out the possibility that the non-physiological absence of 
Orc6 somehow alters the function of ORC-Cdc6, and to examine the 
role of ATP-dependent Mcm2-7 release in vivo, we examined a 
physiologically relevant situation where ORC has been functionally 
inactivated. Phosphorylation of Orc6, along with Orc2, by cyclin- 
dependent kinase (CDK) plays a role in preventing re-replication'*”® 
by blocking the ability of ORC to load Mcm2-7’”. We phosphorylated 
purified ORC (ORC-P) with the yeast mitotic CDK, Clb2—Cdc28, and 
examined its ability to recruit and load Mcm2-7. ORC-P cannot load 
Mcm2-7 (Fig. 5d, compare lanes 2 and 6), but it can recruit Mcm2-7 
subunits as well as the unphosphorylated ORC in the presence of 
ATP-yS (compare lanes 3 and 7). However, ORC-P was unable to 
accumulate Mcm2-7 subunits in the presence of ATP (compare lanes 
1 and 5). Lambda phosphatase treatment of ORC-P restored its ability 
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ATP or ATP-yS. DNA-bound proteins were analysed by silver staining. 

d, Recruitment and loading assays were performed with unphosphorylated 
ORC (ORC) or CDK-phosphorylated ORC (ORC-P). In lanes 9-12 ORC-P 
was previously dephosphorylated with lambda phosphatase (APPase). e, A 
revised model for the mechanism of origin licensing which includes an ATPase- 
dependent release step based on experiments in this manuscript. Details of 
the model are discussed in the text. The “’ in step (5) refers to the assembly of 
the second hexamer, which is discussed in the text. 


to recruit and load Mcm2-7 in ATP (Fig. 5d, lanes 9-12). These 
results show that, as in the cases of Mcm3 alone, Mcm2-7 without 
Cdt1, and ORCA6, the CDK-phosphorylated ORC is competent to 
recruit Mcm2-7 to ORC-Cdcé6, but ATP hydrolysis then promotes 
Mcm2-7 release. 

To provide additional evidence that ATP hydrolysis can promote 
Mcm2-7 release from ORCA6, we have looked at Mcm2-7 recruit- 
ment in ATP at reduced temperatures, to slow ATP hydrolysis. In 
contrast to 30°C (Fig. 5c), the recruitment of Mcm2-7 by ORCA6 
occurred equally well in ATP and ATP-yS at 8°C (Supplementary 
Fig. 10). However, when these reactions were shifted to 30°C after 
9 min, recruitment of Mcm2-7 continued in ATP-yS but Mcm2-7 
was significantly reduced in the presence of ATP, consistent with the 
idea that elevated temperature promotes ATP hydrolysis and sub- 
sequent Mcm2-7 dissociation. 


Discussion 


Our results support a model for origin licensing wherein Cdtl- 
Mcm2-7 is first recruited to origins by interaction between the C 
terminus of Mcm3 and ORC-Cdc6 (Fig. 5e step 3). Cdtl is not 
required for this initial recruitment, in contrast to previous experi- 
ments in crude extracts’”"”. We suggest, by analogy to other AAA+ 
proteins”, that domain movements in ORC-Cdc6 triggered by Mcm3 
binding activate the ATPase of ORC and/or Cdcé by bringing argi- 
nine fingers in contact with ATP binding sites in adjacent subunits 
(Fig. 5e step 4). We further suggest that these domain movements are 
transmitted to Mcm2-7, leading to destabilization of Mcm2, Mcm4 
and Mcmé6 when Cdtl is missing (Fig. 1A, B). Cdt1 stabilizes Mcm2-7 


©2013 Macmillan Publishers Limited. All rights reserved 


in this new conformation by a mechanism that does not require its 
interaction with Orc6 (Fig. 5c). Because both hexamers must interact 
with ORC-Cdcé6 (Fig. 3), our results indicate they are either loaded 
sequentially or there is more than one Mcm3 binding site in ORC- 
Cdc6, allowing concerted loading. 

When all pre-replicative complex components are present and 
in the correct post-translational modification state, ATP hydrolysis 
promotes loading of the ring around DNA (Fig. 5e step 5). How- 
ever, when criteria for correct licensing are not met, ATP hydrolysis 
instead is coupled to irreversible dissociation of inappropriate protein 
assemblies, preventing the accumulation of non-productive com- 
plexes (Fig. 5e step 6). Cdc6 is bound to ORCAG6 in both ATP and 
ATP-YS (Supplementary Fig. 9e), indicating either that Cdc6 remains 
bound to ORC after Cdtl1-—Mcm2-7 release, or that Cdc6 is released, 
but rebinds quickly (Fig. 5e step 7). This ATPase-dependent quality 
control mechanism contributes to preventing licensing by CDK- 
phosphorylated ORC outside of G1 phase, and therefore has a role 
in ensuring once per cell cycle replication. We note that the block to 
re-replication in metazoans works largely through inhibition and 
degradation of Cdt1*'”’, and our results (Fig. 5b) indicate that this 
will also prevent licensing via this ATPase-dependent mechanism. 
Although a great deal is known about the multiple pathways contrib- 
uting to a robust block to re-assembly of pre-replicative complexes 
after replication in eukaryotes”*, few quality control mechanisms 
contributing to efficient pre-replicative complex assembly during 
G1 phase have been described (see ref. 25). The mechanism we have 
described here may also have a role during Gl phase to ensure 
assembly of complete, functional pre-replicative complexes. 


METHODS SUMMARY 


All strains are listed in Supplementary Table 1 and strain constructions are 
described in Methods. Mcm2-7 loading reactions were performed as described 
(ref. 7 and Methods). After isolation and washing of the beads, DNA was cleaved 
from beads using irradiation with a 330 nm light source, which does not induce 
significant levels of DNA crosslinking (see Methods and Supplementary Fig. 11). 
Proteins were analysed after SDS-PAGE by either silver staining or immunoblot- 
ting with a variety of antibodies (Methods). All proteins were expressed in either 
S. cerevisiae or Escherichia coli and were purified as described in Methods. Cdt1 
and Mcm2-7 expressed in both bacteria and yeast were proficient for loading 
(Supplementary Fig. 12). ATPase assays were based on the method described in 
ref. 26 with the modifications described in Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Cdtl-Mcm2-7 overexpressed in yeast (yJF38). To obtain a yeast strain over- 
expressing the Cdtl-Mcm2-7 complex, four new vectors were generated using 
the pRS30_series”’. Making use of the bidirectional inducible GAL1-10 promoter, 
the six MCM subunits, CDT1 and GAL4 genes were cloned and overexpressed 
in the strain yJF38. GAL4 is a positive regulator of GAL genes in response to 
galactose, it was therefore overexpressed together with the MCM2-7 and CDT1 
genes to increase protein yield. 

GAL4 and MCM2 were amplified using the primer pairs JF1-2 and JF3-4, 
respectively. GAL4 was cloned into pRS303 (pJF2.1) and MCM2 into pRS306 
(pJF5.1) between Smal and BamHI sites. MCM7 and MCM5 were amplified with 
JF5-6 and JF7-8 primer pairs and cloned into a Smal site in pRS304 (pJF4.1) and 
pRS306 (pJF3.1), respectively. The forward primers inserted a unique Ascl site at 
the 5’ end of the start codon of these genes. MCM3, MCM6, MCM4 and CDT1 
were amplified using the following primer pairs: JF9-10, JF11-12, JF13-14 and 
JF15-16. The PCR products were cloned between Spel and Not! sites into pJF5.1, 
4.1, 3.1 and 2.1, respectively, to yield pJF5.2, 4.2, 3.2 and 2.2. In this case, a unique 
SgrAI site was inserted at the 5’ end of their start sites. In addition a 3 x Flag tag”* 
was inserted using a SgrAI site present upstream of MCM3 in pJF5.2 (pJF6.2). 
Finally, the GAL1-10 promoter was cloned between the AscI and SgrAI sites 
previously inserted, yielding pJF2, 3, 4 and 6. These vectors were integrated into 
the strain yJF1 (yJF38). 

Cdtl-Mcm2-7 overexpressed in yeast (yJF35, yJF59 and yJF73). yJF35 and 
yJF59 are analogous to yJF38 except that the 3 x Flag tag on MCM3 was inserted at 
its C terminus in yJF35 and on the C terminus of CDT1 in yJF59. The vectors 
integrated into yJF1 were pJF2, 3, 4 and 5. The C-terminal 3 x Flag on MCM3 and 
CDTI was inserted afterwards by PCR-based tagging using pBP83. pJF2, 3, 4 and 
39 vectors were integrated into the strain yJF1, yielding yJF73. 

Mcm2-7 overexpressed in yeast (yJF39). To obtain a Mcm2-7 complex without 
Cdtl, a strategy similar to the design of yJF38 was followed, except that pJF2 was 
not integrated into the final strain and the endogenous CDT! locus was tandem 
affinity purification (TAP)-tagged”’ using pJF21, thus producing yJF39. 

ORC overexpressed in yeast (ySD-ORC). To overexpress ORC in yeast, we 
modified pJF2, 3 and 5 as follows. In pJF5, MCM2 and MCM3 were replaced 
with calmodulin binding peptide-tobacco etch virus (CBP-TEV)-ORCI and 
-ORC2, respectively (pJF19). In pJF2 GAL4 and CDT1 were substituted with 
ORC3 and ORC4, respectively (pJF17). In pJF3, MCMS5 and MCM4 were replaced 
with ORC5 and ORC6, respectively (pJF18). All ORC genes were codon opti- 
mized. Finally, pJF19, 17 and 18 were integrated into yJF1, producing the strain 
ySD-ORC. 

ORC(A6) overexpressed in yeast (ySD-ORC(A6)). In pJF3, MCM5 was replaced 
with ORCS and MCM4 was deleted, producing the vector pJF20. To obtain ORC 
without Orcé, yJF1 was transformed with pJF17, 19 and pJF20, instead of pJF18. 
Furthermore, a C-terminal 3Flag tag was added to the endogenous copy of 
ORC6, using pBP83. 

MCM3 mutants. MCM3 mutants were cloned into two different vectors: pMAL- 
C2P to purify recombinant proteins, and pRS306 for complementation studies 
in yeast. 

pMAL-C2P was a gift from S. Mochida and was derived from pMAL-C2 (NEB) 
by introducing a PreScission protease site before the EcoRI site in the polylinker 
region’. MCM3 was amplified from S. cerevisiae genomic DNA using AM51 and 
AM52 and cloned into pMAL-C2P using Xbal and Sall sites (pAM5). MCM3 
contains a Kpnl site at position 2690. Mutant fragments of 231 base pairs in 
length, between this site and the stop codon were synthesized by GeneArt. The 
mutated sequences incorporated a Notl, followed by a Sall restriction site at the 3’ 
end of the stop codon. Using KpnI and Sall sites, the mutant sequences were 
cloned into pAM5, to yield pJF27 (Mcm3-11), pJF28 (Mcm3-12), pJF29 (Mcm3- 
13), pJF32 (Mcm3-16), pJF35 (Mcm3-22) and pJF36 (Mcm3-23). 

MCM2 in pJF5 was removed, to yield pGC003. MCM3 mutant sequences were 
excised from pJF27, 28, 29, 32, 35 and 36 vectors using AvrII and Not! sites, and 
cloned into pGC003 (pJF8, 9, 10, 13, 15 and 16). Mcm2 in pJF6 was removed, to 
yield pJF37. MBP tag was amplified from pMAL-C2P using the primers JF19 and 
20 and cloned into pGC003 using SgrAI unique site, originating pJF38. Finally, 
these vectors were integrated into a MCM3 degron strain (yKL43’*), producing 
the strains yJF63, 64, 65, 66, 69, 71, 72, 78 and 79. 

MCM3 mutants in a diploid background. Wild-type C-terminal Mcm3 was 
amplified from pGC003, C-terminal Mcm3 mutants were amplified from 
pJF27 (Mcm3-11), pJF28 (Mcm3-12) and pJF29 (Mcm3-13) using primers 
JF17 and JF18. The PCR products contained the URA3 gene and were used to 
transform the diploid strain W303 (all amplified products were sequenced before 
transformation). Transformants were selected for on media lacking uracil. These 
were shown by PCR to be heterozygous for replacement of the C terminus of 


Mcm3 with either wild-type or mutant Mcm3 cassettes as above. Sporulation and 
tetrad dissection of these transformants was performed to examine their viability. 
Fragments of MCM3. A C-terminal fragment of MCM3 (588 base pairs) was 
amplified from S. cerevisiae genomic DNA using AM54 and AM52. This PCR 
product was cloned in pMAL-C2P using Xbal and Sall sites (pAM6). An 
N-terminal fragment of MCM3 (2,331 bp) was amplified from S. cerevisiae geno- 
mic DNA using AM51 and AMS53. This PCR product was cloned in pMAL-C2P 
using Xbal and Sall sites (pAM7). 

Individual MCM subunits expressed in bacteria. MCM4 and 5 were amplified 
by PCR using NC1-2 and NC3-4 primer pairs. These PCR products were cloned 
between NdeI and Xhol restriction sites into pET22b, to yield pET22b-MCM4 
and pET22b-MCM5. MCM6 was excised from pET16b-MCM6 using NdeI and 
BamHI” and cloned into pET11a (pET11a-MCM6). 

Cdt1 expression in bacteria. CDT1 was amplified from S. cerevisiae genomic 
DNA using NC5 and NC6 and cloned into pGEX-6p-1 (GE Healthcare), using 
BamHI and Not! sites (pGEX-6p-1-CDT1). 

Cdc6 expression in bacteria. S. cerevisiae CDC6 was amplified from pET15b- 
CDC6 using the primers AM1 and AM2. The PCR product was cloned between 
BamHI and Xhol restriction sites in pGEX-6p-1 (GE Healthcare), generating 
pAM3. 

Purification of Cdt1-Mcm2-7 from yJF38, yJF35, yJF59 and yJF73. 21 of cells 
were grown in YP-raffinose at 30 °C to a cell density of 4 X 10’ cells per ml and 
arrested for 3h with 100 ng ml! of alpha-factor. Protein expression was induced 
by adding galactose (2%) and incubating at 30 °C for 3-4 h. Cells were collected, 
washed with ice-cold 25 mM HEPES-KOH pH 7.6, 1M sorbitol, then washed 
with buffer A (45 mM HEPES-KOH pH 7.6, 0.02% NP-40, 10% glycerol, 5 mM 
magnesium acetate (Mg(OAc),) and 0.1 M K-glutamate). The pellet was resus- 
pended in 0.5 volumes of buffer A/1 mM DTT/protease inhibitors (Roche) and 
frozen drop-wise in liquid nitrogen. Frozen drops of cells were crushed using a 
freezer mill (SPEX CertiPrep 6850 Freezer/Mill) with 6 cycles of 2 min crushing at 
rate 15. Frozen cell powder was thawed at room temperature, resuspended in 1 
volume of buffer A/I1mM DTT/protease inhibitors, and the concentration of 
K-glutamate was adjusted to 0.5 M. The suspension was centrifuged for 1h at 
50,000 r.p.m. using a Ti70.1 rotor (257,000g). The clear phase was recovered, 
dialysed twice for 1h against buffer B (25mM HEPES-KOH pH7.6, 0.02% 
NP-40 and 10% glycerol)/0.1 M K glutamate/5 mM Mg(OAc),, and then centri- 
fuged for 30 min at 50,000 r.p.m. Anti-Flag immunoprecipitation was performed 
by adding 1 ml packed bead volume of washed anti-Flag M2 agarose (Sigma) to 
the supernatant and incubating for 1 h at 4 °C. Beads were recovered, and washed 
with ten bed resin volumes (BVs) of buffer B/0.1M K glutamate/1 mM ATP/ 
5 mM Mg(OAc)p. Elution was performed by adding 0.5 mg ml 3XFlag peptide. 
After 20min at 4°C, the flow-through was collected, concentrated using a 
Microcon YM-10, 10000 MWCO (Millipore) and fractionated on a 24ml 
Superdex 200 10/300 GL column (GE Healthcare) pre-equilibrated in buffer 
B/0.1M K acetate. Peak fractions were pooled and aliquoted. 

Purification of Mcm2-7 from yJF39. yJF39 was purified as described for yJF38. 
The only difference was a pre-incubation step with IgG Sepharose 6 Fast Flow 
beads (GE Healthcare), to remove endogenous Cdtl, before the sample was 
loaded onto the gel filtration column. 100 ul of IgG packed resin was added to 
the 3 X Flag peptide-eluted sample and incubated for 30 min at 4 °C. The unbound 
fraction was recovered, concentrated and loaded onto a Superdex 200 10/300 GL 
column (GE Healthcare), as described for yJF39. 

Purification of ORC from ySD-ORC. 2] of cells were grown in YP-raffinose at 
30 °C toa cell density of 4 X 10’ cells per ml and arrested for 3 h with 100 ng ml ' 
of alpha-factor. After arrest, protein expression was induced by adding galactose 
(2%) for 3-4h at 30°C. Cells were collected, washed with ice-cold 25 mM 
HEPES-KOH pH 7.6/1 M sorbitol, then washed with buffer C (25 mM HEPES- 
KOH pH 7.6, 0.05% NP-40 and 10% glycerol)/0.1 M KCL. The pellet was resus- 
pended in 0.5 volumes of buffer C/0.1M KCl/1mM DTT/Protease inhibitors 
(Roche) and frozen drop-wise in liquid nitrogen. Frozen drops of cells were 
crushed using a freezer mill (SPEX CertiPrep 6850 Freezer/Mill) with 6 cycles 
of 2 min crushing at rate 15. Frozen cell powder was thawed at room temperature, 
resuspended in 1 volume of buffer C/0.1 M KCl/1 mM DTT/protease inhibitors, 
and the concentration of KCl was adjusted to 0.5 M. The suspension was centri- 
fuged for one hour at 50,000r.p.m. using a Ti70.1 rotor. The clear phase was 
recovered, and subjected to calmodulin affinity purification by adding 2mM 
CaCl, and 1.5ml of packed beads of Calmodulin affinity resin (Stratagene). 
After 3h at 4°C, beads were collected, washed with 10 BVs of buffer C/0.3 M 
KCl/2 mM CaCl,/2 mM f-mercaptoethanol. Elution was performed with buffer 
C/0.3M KCl/2mM EGTA/1mM EDTA/2 mM -mercaptoethanol, peak frac- 
tions were pooled and subjected to fractionation over a Superdex 200 10/300 
GL column (GE Healthcare), pre-equilibrated in buffer C/0.15 mM KCl/2 mM 
B-mercaptoethanol. Fractions containing ORC were pooled and concentrated 
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over a 1 ml MonoQ 5/50 GL column (GE Healthcare) using an elution gradient of 
0.15-0.5 M KCl over 10 column volumes. Peak fractions containing ORC were 
pooled, dialysed against buffer C/0.3 M K-acetate and stored in aliquots. 
Purification of ORC(A6) from ySD-ORC(A6). To purify ORC without Orcé6, a 
similar protocol to the purification of ORC was followed, except that Flag immu- 
noprecipitation of endogenous Orc6 was included, to remove complexes con- 
taining Orc6 (Supplementary Fig. 9c). This was performed as follows: pooled 
fractions from the gel filtration were incubated for 30 min at 4 °C with 1 ml packed 
bead volume of anti-Flag M2 agarose (Sigma). Flow-through was collected and 
concentrated over a MonoQ 5/50 GL column (GE Healthcare), as described for the 
purification of ORC. 

Proteins expressed in bacteria. All expression plasmids were transformed into 
BL21 DE3 Codon+ RIL cells (Stratagene) (unless indicated). 31 of cells (unless 
indicated) were grown at 37°C to a density of Dgoonm = 0.5-0.8. Cells were 
chilled on ice, and then IPTG was added to 1 mM. Induction was carried out 
overnight at 18°C. 

Purification of MBP-Mcm3. 0.51 of cells expressing MBP-Mcm3 were grown as 
described above. Cells were collected, washed once with ice-cold 25 mM HEPES- 
KOH pH7.6/1M sorbitol, once with buffer D (50 mM Tris-HCl pH 7.5, 0.05% 
NP-40, 10% glycerol)/1 M NaCl and then the pellet was resuspended in 20 ml of 
buffer D/1 M NaCl/2 mM B-mercaptoethanol/protease inhibitors (Roche). 50 pl 
of lysozyme (50 mg ml!) was added and the suspension incubated for 20 min 
at 4°C. Cells were kept on ice and sonicated 3 X 30s at 15 microns using a 
sonicator Soniprep 150 (Sanyo). Lysate was centrifuged for 1h at 45,000 r.p.m. 
using a Ti45 rotor (235,400g). The soluble phase was collected and incubated with 
2ml packed amylose bead volume (NEB) at 4°C for 1h. Beads were washed 
with ten BVs of buffer D/0.3M NaCl/2mM £-mercaptoethanol. Elution was 
performed with buffer D/0.3 M NaCl/2 mM {-mercaptoethanol/10 mM maltose. 
Peak fractions were pooled and 0.5 ml were subjected to fractionation over a 
Superdex 200 10/300 GL column (GE Healthcare) pre-equilibrated in buffer 
D/0.3 M NaCl/2mM f-mercaptoethanol. The peak fraction, of correct mole- 
cular weight, was collected and subjected to PreScission protease cleavage with 
100 units of PreScission protease (GE healthcare). After 30 min at room temper- 
ature, 1 volume of buffer D/2 mM B-mercaptoethanol was added, and the sample 
was fractionated over a 0.15 ml MonoQ PC1.6/5 column (GE Healthcare), using 
an elution gradient of 0.15-0.7 M NaCl over 50 column volumes. Peak fractions 
containing untagged Mcm3 proteins were kept. The same protocol was used to 
purify Mcm3 fragments and mutants. 

Purification of Mcm2, Mcm7 and Mcm3. To purify these proteins, we followed 
the protocol described in ref. 31. The starting cultures were 31 in volume. 
Purification of Mcm5. Cells were grown and expressed as described above. Cells 
were collected, washed once with ice-cold 25 mM HEPES-KOH pH 7.6/1 M sor- 
bitol, once with buffer E (50 mM Tris-HCl pH 7.5, 0.05% NP-40, 10% glycerol, 
1mM EDTA and 1 mM DTT)/1 M NaCl, and the pellet was resuspended in 40 ml 
of buffer E/1 M NaCl/protease inhibitors (Roche). Cells were sonicated and the 
lysate centrifuged as described for the purification of MBP-Mcm3. The soluble 
fraction from centrifugation was treated with 0.3 gml_’ ammonium sulphate, 
and stirred for 20min at 4°C. The mixture was centrifuged at 17,000 r.p.m. 
(34,600g) for 20min. The pellet was resuspended in 40 ml of buffer E con- 
taining 0.25 gml' of ammonium sulphate and centrifuged as before. This was 
repeated with 0.20 mg ml’ ammonium sulphate. The pellet was resuspended in 
30 ml of buffer E and dialysed against buffer E for one hour at 4 °C. After checking 
that the conductivity of the sample was below that of buffer E/0.1.M NaCl, the 
sample was applied to a 5 ml FF Q column (GE Healthcare), pre-equilibrated in 
buffer E/0.1 M NaCl. Elution was done using a gradient of 0.1 M to 0.5M NaCl 
over 10 column volumes. Peak fractions were pooled and diluted down to a 
conductivity below that of buffer E/0.1 M NaCl. The sample was then applied to 
a 5 ml Heparin column (GE healthcare) pre-equilibrated in buffer E/0.1 M NaCl. 
Flow through was collected and subjected to fractionation over an 8 ml MonoQ 
column (GE Healthcare). Elution was performed using a gradient of 0.1M to 
0.5 M NaCl over 20 column volumes. Peak fractions were collected and subjected 
to fractionation over a Superdex 200 10/300 GL column (GE Healthcare) pre- 
equilibrated with buffer E/0.3 M NaCl. Peak fractions were pooled, concentrated 
using a Microcon YM-10, 10000 MWCO (Millipore) and aliquoted. 
Purification of Mcm4. 31 of expressing cells were grown and we followed the 
purification process described in ref. 31. At the end of the described protocol, we 
introduced an extra gel filtration step using a Superdex 200 10/300 GL column 
(GE Healthcare), pre-equilibrated in buffer E/0.3 M NaCl. 

Purification of Mcm6. To purify Mcm6, the same protocol as that described for 
Mcm5 was followed, the difference being that Mcm6 was bound to a 5 ml heparin 
column (GE Healthcare). Bound protein was eluted using a gradient of 0.1 to 
0.5M NaCl over 20 column volumes. Peak fractions were pooled and the 
conductivity was reduced to below that of buffer E/0.1M NaCl. The sample 


ARTICLE 


was then applied to an 8 ml MonoQ column (GE Healthcare), as described in 
the purification of Mcm5. 

Purification of Cdtl. Expressing cells were grown and induced as described 
before. Cells were collected, washed with PBS pH 7.5, and then washed with buffer 
F (PBS pH7.5, 10% glycerol, 1 mM DTT and 0.3 M NaCl). The pellet was resus- 
pended in 40 ml of buffer F/protease inhibitors (Roche) and sonicated 3 X 30s 
using a sonicator Soniprep 150 (Sanyo) at 15 microns. Lysate was centrifuged at 
45,000 r.p.m. (235,400g) for 60 min using a Ti45 rotor. 2 ml of packed bead 
volume glutathione Sepharose 4Fast Flow (GE Healthcare) were added to the 
soluble phase and incubated for 2.5h at 4°C. The solution was centrifuged at 
3,000 r.p.m. (1,400g) for 3 min, beads were collected, 10 column volumes of buffer 
F were added, and the mixture was incubated for 10 min at 4°C. This wash step 
was repeated with 20 column volumes of buffer F, and 10 column volumes of 
buffer E/0.3 M NaCl). The pellet was then resuspended in 2 ml of buffer E/0.3 M 
NaCl. 125 units of PreScission Protease (GE Healthcare) were added and the 
mixture was incubated overnight at 4°C. The flow through was collected and 
concentrated using a Microcon YM-10, 10000 MWCO (Millipore), then the 
sample was loaded onto a gel filtration column (Superdex 200 10/300 GL column 
(GE Healthcare)) pre-equilibrated in buffer B/1 mM EDTA/1mM DTT/0.3M 
NaCl. Peak fractions were pooled, concentrated and aliquoted. 

Formation of MCM complexes from individually purified subunits. MCM 
complexes were formed by combining 10g of individually purified subunits 
and fractionated over a Superdex 200 PC 3.2/30 column (GE Healthcare), pre- 
equilibrated in buffer B/0.1 M K acetate. High molecular weight fractions con- 
taining Cdtl-Mcm2-7 complex were pooled, concentrated and aliquoted. 
Supplementary Fig. 12 shows that this complex can be loaded as efficiently as 
the complex purified from yeast. 

Purification of GST-Cdc6. This protocol is modified from that described in ref. 
10. 11 of expressing cells were grown at 37 °C to Deoo nm = 0.6, then induced with 
0.5 mM IPTG for 5h at 18 °C. Cells were collected at 6,000 r.p.m. (6,100g) in an 
SLA-3000 rotor (Sorvall) for 10 min. Pellets were resuspended in 50 ml buffer G 
(50 mM K,HPO,/KH>2PO,, pH7.5, 5mM MgCl, 1% Triton X-100 and 1mM 
DTT)/2mM ATP/0.15M KOAc/protease inhibitors (Roche) and 100 1g ml! 
lysozyme added. The mixture was incubated at 4°C for 30 min and sonicated 
for 2min (5s off, 5s on) at 15 microns. The suspension was centrifuged at 
15,000 r.p.m. (27,000g) for 15 min in a $S34 rotor (Sorvall) and the supernatant 
transferred to 2 ml bed resin glutathione Sepharose (GE Healthcare). This was 
rotated at 4 °C for 3 h. Glutathione beads and bound proteins were recovered and 
washed with 20 column volumes of buffer G/0.15 M KOAc/2 mM ATP. A 50% 
slurry with buffer G/0.15M KOAc/2mM ATP was made and 100 units 
preScission protease (GE Healthcare) added. The mixture was incubated for 
2h at 4°C. The flow-through was recovered and the concentration of KOAc 
diluted to 75 mM with buffer G/2 mM ATP. This was incubated with 2 ml bed 
resin hydroxyapatite prewashed in buffer G/0.075M KOAc/2mM ATP. The 
protein-hydroxyapatite was washed with 5 BVs of buffer G/2 mM ATP and then 
washed with 5 BVs of buffer G/0.15 M KOAc/15% glycerol. Cdc6 was eluted with 
buffer G/0.4 M KOAc/15% glycerol. Peak fractions were pooled and concentrated 
using a Centricon Plus-20 Centrifugal Filter (Millipore), then aliquoted. 
Purification of ORC-P. To purify CDK-phosphorylated ORC (ORC-P), ORC 
was isolated from 501 of alpha-factor-arrested G1 yeast (YDR11) cells via calmo- 
dulin pull-down as described*, incubated with TEV protease overnight at 4 °C to 
remove the tag, dialysed against 25 mM HEPES-KOH pH 7.6/0.1 M KCl/5mM 
Mg(OAc)>/0.02% NP40/10% glycerol/1mM DTT, and concentrated to a final 
volume of 2 ml. 

Ina parallel preparation, 501 of YDR12 (MATa ade2-1 ura3-1 his3-11,15 trp1-1 
leu2-3,112 can1-100 cdc15-2 barl::kanMX pep4::HIS3 ura3::Pgall, 10-CDC6- 
TAPtcp (URA3)) were grown at 25°C in YP-raffinose to 2 X 10’ cells per ml. 
Cdc6-TAP'©? expression was induced by addition of 2% galactose for 8h at 
25 °C, upon which the cells arrest with a long-budded phenotype and with repli- 
cated DNA in G2/M phase. The cells were collected by centrifugation, washed 
twice with cold buffer containing 25 mM HEPES-KOH pH 7.6/1 M sorbitol, once 
with cold buffer containing 45 mM HEPES-KOH pH 7.6/0.1 M KCI/0.02% NP- 
40/10% glycerol, resuspended in 0.5 volume of packed cell volume of buffer 
containing 45 mM HEPES-KOH pH 7.6/0.1 M KC1/0.02% NP-40/10% glycerol/ 
2mM DTT/2X complete protease inhibitor cocktail (Roche), and the resulting 
cell suspension frozen drop-wise directly in liquid nitrogen. The frozen cell 
suspension was crushed using a SPEX 6870 Freezer/Mill, the resulting frozen 
powder thawed on ice, diluted with 1 volume of 45 mM HEPES-KOH pH 7.6/ 
0.1M KCI/0.02% NP-40/10% glycerol/iImM DTT, the salt concentration 
adjusted to 0.3M KCL, and the resulting lysate clarified by centrifugation in a 
Beckman 45 Ti rotor at 42,000 r.p.m. (205,000g) for 1 h. The clarified extract was 
supplemented with 2mM CaCl, and a complex containing Cdce6-TAP’©?- 
Clb2-Cdc28-Cks1 was isolated from this extract using calmodulin affinity 
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purification in buffer containing 45 mM HEPES-KOH pH 7.6/0.3 M KCI/0.02% 
NP-40/10% glycerol/1 mM DTT. This partially purified complex was incubated 
with TEV protease overnight at 4°C to remove the tag from Cdcé, dialysed 
against 25 mM HEPES-KOH pH 7.6/0.1 M KCI/5 mM Mg(OAc),/0.02% NP40/ 
10% glycerol/1 mM DTT, and concentrated to a final volume of 2 ml. 

The concentrated calmodulin-purified and TEV-protease-digested ORC and 
Cdc6-Clb2-Cdc28-Cks1 fractions were combined, supplemented with 3 mM 
ATP, and incubated for 45 min at 30°C. The phosphorylated ORC (ORC-P) 
resulting from this reaction was re-isolated from the reaction by gel-filtration 
on a 120 ml Superdex 200 column in a buffer containing 25 mM HEPES-KOH 
pH7.6/0.1 M KC1/0.02% NP40/10% glycerol/1 mM DTT, followed by fractiona- 
tion on a 1 ml MonoQ ion-exchange column as described’. 

Loading reaction. In this study, four main differences to the conditions described 
in ref. 7 were introduced in the loading reactions. K-glutamate was substituted 
with K-acetate in the binding and washing buffers. For silver staining purposes we 
increased the amount of protein used in the assays by four fold compared to ref. 7. 
In addition, ARS305 was amplified using an oligonucleotide primer with a photo- 
cleavable biotin (Integrated DNA technologies) at one end, as described in ref. 32. 
We have optimised the photocleavage to minimise DNA damage by irradiating 
for 10 min at 330 nm (See Supplementary Fig. 11). And finally, 2.5 pmol of DNA 
molecules have been used, instead of the 1 pmol used before. 

Loading assays with ORC and ORC-P. Mcm2-7 loading onto immobilized 
linear 1 kb ARS305-containing DNA using 50nM purified ORC or ORC-P was 
performed as described*. To test the effect of ORC-P dephosphorylation by 
lambda-phosphatase, 20 pmol of purified ORC-P at a concentration of 0.67 mM 
was supplemented with 20 mM MnCl, and incubated for 20 min at 30°C with 
either 400 units of l-phosphatase (NEB) or a buffer control before addition of 
the treated ORC-P to the loading reaction. 

ATPase assays. Based on the method described in ref. 26 with the following 
modifications. Reactions were carried out at 30°C for 20 min, buffer contained 
25 mM HEPES-KOH, pH 7.6/0.1% NP-40/5 mM Mg(OAc),/1 mM EDTA/1 mM 
EGTA/100mM_ K-acetate/5% glycerol/ImM DTT/100HM ATP (including 
2.5uCi of [a-**P]ATP). Each reaction contained 2.5 pmol of 1kb linear 


ARS305 DNA. Where indicated, 2.5 pmol of each protein were also included 
in these reactions. Reactions were stopped by spotting 1 il of each reaction on 
PEI-cellulose TLC plates (CamLab). The cellulose membrane was developed 
in 0.6 M Na,HPO,/NaH,PO, pH 3.5, and quantified on a Phosphorimager (GE 
Healthcare). 

Antibodies for western blot analysis. Anti- Mcm2 (yN-19, sc-6680, Santa Cruz), 
anti-Mcm7 (yN-19, sc-6688, Santa Cruz), anti-Mcm4 (yC-19, sc-6685, Santa 
Cruz), anti-Mcm5 (yC-19, sc-6687, Santa Cruz), anti-Orc6 (SB49), anti-Cdc6 
(98H/5), anti-Cdt1*’, anti-Flag M2-peroxidase (Sigma), anti-PAP (peroxidase- 
anti-peroxidase)-HRP (horseradish peroxidase) (Sigma), anti-MBP (maltose 
binding protein)-HRP (NEB). Antibodies against Mcm6 and Mcm3 were gifts 
from the Labib laboratory™*. 
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Dusty starburst galaxies in the early Universe as 
revealed by gravitational lensing 
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In the past decade, our understanding of galaxy evolution has been 
revolutionized by the discovery that luminous, dusty starburst 
galaxies were 1,000 times more abundant in the early Universe than 
at present'”. It has, however, been difficult to measure the complete 
redshift distribution of these objects, especially at the highest red- 
shifts (z > 4). Here we report a redshift survey at a wavelength of 
three millimetres, targeting carbon monoxide line emission from 
the star-forming molecular gas in the direction of extraordinarily 
bright millimetre-wave-selected sources. High-resolution imaging 
demonstrates that these sources are strongly gravitationally lensed 
by foreground galaxies. We detect spectral lines in 23 out of 26 
sources and multiple lines in 12 of those 23 sources, from which we 
obtain robust, unambiguous redshifts. At least 10 of the sources are 
found to lie at z > 4, indicating that the fraction of dusty starburst 
galaxies at high redshifts is greater than previously thought. 
Models of lens geometries in the sample indicate that the back- 
ground objects are ultra-luminous infrared galaxies, powered by 
extreme bursts of star formation. 

We constructed a catalogue of high-redshift (z > 1) galaxy candi- 
dates from the first 1,300 square degrees of the South Pole Telescope 
(SPT) survey by selecting sources with dust-like spectral indexes in the 
1.4 and 2.0 mm SPT bands*. A remarkable aspect of selecting sources 
based on their flux at millimetre wavelengths is the so-called negative 
k-correction®, whereby cosmological dimming is compensated by 
the steeply rising dust spectrum as the source redshift increases. As a 
result, a millimetre-wave-selected sample should draw from the red- 
shift distribution of dusty starburst galaxies with little bias over the 
entire redshift range in which they are expected to exist. To isolate 
the high-redshift, dusty-spectrum galaxy population, sources with 
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counterparts in the IRAS Faint Source Catalog® (typically z< 0.03) 
were removed, and those with counterparts in the 843 MHz Sydney 
University Molonglo Sky Survey’ were removed to exclude sources 
with strong synchrotron emission (for example, flat-spectrum radio 
quasars) that may have passed the spectral index cut. A sample of 47 
sources with 1.4-mm flux density >20 mJy and accurate positions 
were selected for high-resolution imaging with the Atacama Large 
Millimeter/submillimeter Array (ALMA). Our ALMA spectroscopic 
observations targeted a sample of 26 sources, all but two of which are in 
the imaging sample (see Supplementary Information). These objects 
are among the brightest dusty-spectrum sources in the z > 0.1 extra- 
galactic sky at millimetre wavelengths. 

Gravitationally lensed sources are expected to predominate in 
samples of the very brightest dusty galaxies because of the rarity of 
unlensed dusty starburst galaxies at these flux levels* '°. Massive ellip- 
tical galaxies, acting as lenses, will have Einstein radii as large as 2” and 
may magnify background galaxies by factors of 10 or more. To confirm 
the lensing hypothesis and determine magnifications, we imaged 
47 SPT sources with ALMA at 870 um in two array configurations, 
which provide angular resolutions of 1.5” and 0.5” (full-width at half- 
maximum). A sample of these objects with infrared imaging, spectro- 
scopic redshifts and resolved structure is shown in Fig. 1. Integration 
times of only one minute per source are adequate to show that most 
sources are resolved into arcs or Einstein rings—hallmarks of gravi- 
tational lensing. For all sources for which we have infrared and sub- 
millimetre imaging, as well as spectroscopic redshifts, the emission 
detected by ALMA coincides with massive foreground galaxies or 
galaxy groups/clusters, but is spatially distinct and at drastically different 
redshifts (see Fig. 2 and Supplementary Fig. 3). Using a modelling 


1California Institute of Technology, 1200 East California Boulevard, Pasadena, California 91125, USA. *Steward Observatory, University of Arizona, 933 North Cherry Avenue, Tucson, Arizona 85721, USA. 
3Department of Physics and Atmospheric Science, Dalhousie University, Halifax, Nova Scotia B3H 3/5, Canada. “Institute of Astronomy, University of Cambridge, Madingley Road, Cambridge CB3 OHA, UK. 
5European Southern Observatory, Karl-Schwarzschild Strasse, D-85748 Garching bei Miinchen, Germany. °Department of Physics, McGill University, 3600 Rue University, Montreal, Quebec H3A 278, 
Canada. ’Max-Planck-Institut fiir Radioastronomie, Auf dem Hiigel 69, D-53121 Bonn, Germany. ®University of Pennsylvania, 209 South 33rd Street, Philadelphia, Pennsylvania 19104, USA. °University of 
Chicago, 5640 South Ellis Avenue, Chicago, Illinois 60637, USA. 10Harvard-Smithsonian Center for Astrophysics, 60 Garden Street, Cambridge, Massachusetts 02138, USA. Department of Physics, 


Harvard University, 17 Oxford Street, Cambridge, Massachusetts 02138, USA. !?Kavli Institute for Cosmological Physics, University of Chicago, 5640 Sout! 


Ellis Avenue, Chicago, Illinois 60637, USA. 


13Enrico Fermi Institute, University of Chicago, 5640 South Ellis Avenue, Chicago, Illinois 60637, USA. “Department of Physics, University of Chicago, 5640 South Ellis Avenue, Chicago, Illinois 60637, USA. 
15 Jet Propulsion Laboratory, 4800 Oak Grove Drive, Pasadena, California 91109, USA. }°Department of Physics and Astronomy, University of Missouri, 5110 Rockhill Road, Kansas City, Missouri 64110, 
USA. !’Department of Astronomy and Astrophysics, University of Chicago, 5640 South Ellis Avenue, Chicago, Illinois 60637, USA. '8Argonne National Laboratory, 9700 South Cass Avenue, Argonne, Illinois 
60439, USA. !?National Radio Astronomy Observatory, 520 Edgemont Road, Charlottesville, Virginia 22903, USA. @°Department of Physics, University of California, One Shields Avenue, Davis, California 
95616, USA. *!Department of Physics, University of California, Berkeley, California 94720, USA. 22Department of Astronomy, University of Florida, Gainesville, Florida 32611, USA. *°Department of Physics 


and Astronomy, University College London, Gower Street, London WC1E 6BT, UK. **Department of Astrophysical and Planetary Sciences and Department of Physics, University of Colorado, Boulder, 


Colorado 80309, USA. *>Physics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA. °°Department of Physics and Astronomy, University of California, Los Angeles, California 
90095-1547, USA. ?’Australia Telescope National Facility, CSIRO, Epping, New South Wales 1710, Australia. ®Department of Physics, University of Michigan, 450 Church Street, Ann Arbor, Michigan 
48109, USA. ?°Observatories of the Carnegie Institution for Science, 813 Santa Barbara Street, Pasadena, California 91101, USA. 3°Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, 


Maryland 21218, USA. ?!Physics Department, Center for Education and Research in Cosmology and Astrophysics, Case Western Reserve University, Cleveland, Ohio 44106, USA. °*Department o 


Astronomy, University of Michigan, 500 Church Street, Ann Arbor, Michigan 48109, USA. °°Liberal Arts Department, School of the Art Institute of Chicago, 112 South Michigan Avenue, Chicago, Illinois 
60603, USA. 3*Department of Physics, Yale University, PO Box 208210, New Haven, Connecticut 06520-8120, USA. **Institut d'Astrophysique Spatiale, Batiment 121, Université Paris-Sud XI et CNRS, 


91405 Orsay Cedex, France. 


344 | NATURE | VOL 495 | 21 MARCH 2013 


©2013 Macmillan Publishers Limited. All rights reserved 


= 
SPT 0103-45 SPT 0113-46 SPT 0125-47 SPT 0346-52 SPT 0418-47 
Xe) oe 
& ’ , . q > 
og + 

z= 3.090 7=4.232 z=2.514 z= 5.656 z= 4.224 
HST/WFC3 HST/WFC3 VLT/ISAAC HST/WEC3 VLT/SAAC 

SPT 0529-54 SPT 0538-50 SPT 2134-50 SPT 2146-55 _ SPT 2147-50 


z= 3.369 
SOAR/OSIRIS 


z= 2.782 
HST/WFC3 


z=2.780 
HST/WFC3 


z= 4.567 
VLT/ISAAC 


“* 


z=3.761 
VLT/SAAC 


LETTER 


Figure 1 | Near-infrared and ALMA submillimetre-wavelength images of 
SPT targets. Images are 8” X 8”. We show 10 sources for which we have 
confirmed ALMA spectroscopic redshifts, deep near-infrared (NIR) imaging, 
and well-resolved structure in the ALMA 870 Lm imaging; source names are in 
blue in each panel. The greyscale images are NIR exposures from the Hubble 
Space Telescope Wide Field Camera 3 (HST/WFC3, co-added F160W and 
F110W filters), the Very Large Telescope Infrared Spectrometer and Array 
Camera (VLT/ISAAC: K; band) or the Southern Astrophysical Research 
Telescope Ohio State Infrared Imager/Spectrometer (SOAR/OSIRIS: K, band), 
and trace the starlight from the foreground lensing galaxy. The NIR images are 
shown with logarithmic stretch, and each panel shows at bottom left in black 
the telescope/instrument used to obtain the image. The red contours are ALMA 


procedure that treats the interferometer data in their native measure- 
ment space, rather than through reconstructed sky images, to simulta- 
neously determine the source/lens configuration and correct for 
antenna-based phase errors’’, we are able to determine magnifications 
and derive intrinsic luminosities for our sources. Complete models of 
four lenses'!, as well as preliminary models of eight more, indicate 
lensing magnifications between 4 and 22. After correcting for the 
magnification, these sources are extremely luminous—more than 
10’* times solar luminosity (Ls) and sometimes >10'°L 5—implying 
star-formation rates in excess of 500 Mo yr’. 

Obtaining spectroscopic redshifts for high-redshift, dusty starburst 
galaxies has been notoriously difficult. To date, most spectroscopic 
redshift measurements have come from the rest-frame ultraviolet and 
optical wavebands after multi-wavelength counterpart identification*’*”. 
These observations are difficult, owing to the extinction of the ultra- 
violet light by the dust itself, the cosmological dimming, and the ambi- 
guity in the association of the dust emission with multiple sources of 
optical emission visible in deep observations. A much more direct 
method to determine redshifts of starburst galaxies, particularly at high 
redshift, is through observations of molecular emission associated with 
their dusty star-forming regions. The millimetre and submillimetre 
transitions of molecular carbon monoxide (CO) and neutral carbon 
(C1) are well-suited for this purpose’. These emission lines are a major 
source of cooling for the warm molecular gas fuelling the star forma- 
tion, and can thus be related unambiguously to the submillimetre 
continuum source’*. Until recently, bandwidth and sensitivity limi- 
tations made this approach time-intensive. The combination of 
ALMA—even with its restricted early science capabilities and only 
16 antennas—and a unique sample of extraordinarily bright millimetre 
sources has changed this situation greatly, allowing us to undertake a 
sensitive, systematic search for molecular and atomic lines across 
broad swaths of redshift space at z > 1. 

We conducted a redshift search in the 3 mm atmospheric transmis- 
sion window with ALMA using five spectral tunings to cover 84.2- 
114.9 GHz. For z> 1, at least one CO line will fall in this frequency 
range, except for a small redshift ‘desert’ (1.74 < z< 2.00). For sources 
at z > 3, multiple transitions (such as rotational transitions of CO and 
C1?P,—>7P,)) are redshifted into the observing band, allowing for an 
unambiguous redshift determination. We find one or more spectral 


870 tum imaging showing the background source structure, clearly indicative of 
strong lensing from galaxy-scale haloes. In all cases, the contours start at 5¢ and 
are equally spaced up to 90% of the peak significance, which ranges from 12 to 
35. Spectroscopic redshifts of the background sources are shown in red in each 
panel, above the NIR telescope/instrument names. The ALMA exposures were 
approximately 2-min integrations, roughly equally divided between the 
compact and extended array configurations. The resulting resolution is 0.5”. 
SPT 0103-45 shows a rare lensing configuration of one lens and two 
background sources at different redshifts, one visible with ALMA and one with 
HST. SPT 0346-52, with a CO-derived redshift of z = 5.656, is among the 
highest-redshift starbursts known. (See Supplementary Information for more 
details.) 


features in 23 of 26 SPT-selected sources. The detections comprise 44 
emission line features, which we identify as redshifted emission from 
molecular transitions of '*CO, CO, H,O and H,0*, and a Cr fine 
structure line. The spectra of all sources are shown in Fig. 2. For 18 of 
the sources we are able to infer unique redshift solutions, either from 
ALMA data alone (12), or with the addition of data from the Very 
Large Telescope and/or the Atacama Pathfinder Experiment telescope 
(6). With the 10 z>4 objects discovered here, we have more than 
doubled the number of spectroscopically confirmed, ultra-luminous 
galaxies discovered at z > 4 in millimetre/submillimetre surveys in the 
literature (of which just nine have been reported previously'*!*'**"). 
Two sources are at z = 5.7, placing them among the most distant ultra- 
luminous starburst galaxies known. 

The SPT dusty galaxy redshift sample comprises 28 sources, as we 
include an additional two SPT sources with spectroscopic redshifts” 
that would have been included in the ALMA program had their red- 
shifts not already been determined. Of the 26 ALMA targets, three lack 
a spectral line feature in the ALMA band. We tentatively and conser- 
vatively place these at z= 1.85, in the middle of the z= 1.74-2.00 
redshift desert, though it is also possible that they are located at very 
high redshift or have anomalously faint CO lines. For the five sources 
for which only a single emission line is found, only two or three red- 
shifts are possible (corresponding to two choices of CO transition) 
after excluding redshift choices for which the implied dust temper- 
ature—derived from our extensive millimetre/submillimetre pho- 
tometric coverage (provided by ALMA 3mm, SPT 2 and 1mm, 
APEX/LABOCA 870 um and Herschel/SPIRE 500, 350, 250 tum obser- 
vations”)—is inconsistent with the range seen in other luminous 
galaxies”. For these sources, we adopt the redshift corresponding to 
the dust temperature closest to the median dust temperature in the 
unambiguous spectroscopic sample, as shown in Fig. 2. 

The cumulative distribution function ofall redshifts in this sample is 
shown in Fig. 3. The median redshift of our full sample is Zea = 3.6. 
The redshift distribution of SPT sources with millimetre spectroscopic 
redshifts is in sharp contrast to that of radio-identified starbursts 
with optical spectroscopic redshifts, which have a significantly lower 
median redshift of Z,¢q = 2.2, and for which only 15-20% of the 
population is expected to be at z>3 (ref. 2). Part of this difference 
can be attributed to the high flux threshold of the original SPT 
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Figure 2 | ALMA 3 mm spectra of 26 SPT sources. The vertical axis is 
observed flux density in units of mJy, with 30 mJy offsets between sources for 
clarity. Spectra are continuum-subtracted. The strong CO lines are indicative of 
dust-enshrouded active star formation. The spectra are labelled by source and 
redshift. Black labels indicate unambiguous redshifts (18), with the subset in 
bold font (12) having been derived from the ALMA data alone. Sources labelled 
in blue (5) are plotted at the most likely redshift of multiple options, based on 
the dust temperature derived from extensive far-infrared photometry. Three 
sources with no lines detected are placed at z = 1.85, in the middle of the 


selection, which effectively requires that the sources be gravitationally 
lensed. A much smaller total volume is lensed at z < 1 than at higher 
redshift, and, as expected, we do not find any such sources in the SPT 
sample”. However, if we only compare sources at z> 2 (the lowest 
confirmed spectroscopic redshift in the SPT sample), the median 
redshift of the radio-identified sample is still significantly lower (2.6) 
than the SPT sample, and the probability that both samples are drawn 
from the same distribution is <10 ° by the Kolmogorov-Smirnov 
test. A recently published survey” of millimetre-identified starbursts 
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redshift range for which we expect no strong lines, and labelled in red. Total 
integration times for each source were roughly ten minutes. The synthesized 
beam size ranges from 7” X 5” to 5” X 3” over the frequency range of the search, 
which is inadequate to spatially resolve the velocity structure of the lensed 
sources. Transitions of species detected in at least one source are indicated by 
vertical lines. Rotational transitions of the '*CO (solid) and '*CO (dashed) 
isotopologues are shown in grey. Water lines are marked by blue dashed lines, 
ionized water lines by red dashed lines, and atomic carbon ([C1]) by the green 
dashed line. 


with optical counterparts determined from high-resolution millimetre 
imaging and redshifts measured from optical spectroscopy or esti- 
mated from optical photometry found a median redshift of Zeq = 2.8. 
Again comparing the distribution of sources at z> 2, the probabi- 
lity that these objects and the SPT-selected sources are drawn from 
the same parent distribution is 0.43, indicating rough consistency 
between our secure redshift determinations and the distribution esti- 
mated from the optical methods. A full analysis of the molecular line 
detections, redshift determinations, residual selection effects, and a 


Figure 3 | The cumulative redshift distribution of luminous dusty starburst 
galaxies, as measured with different techniques. The SPT millimetre-selected 
sample (SPT+ALMA), with redshifts directly determined from spectroscopic 
observations of the molecular gas in the galaxies, is shown in black. The existing 
samples of radio-identified (Radio-ID) starbursts”'*'”"*”’, with redshifts 
determined from rest-frame ultraviolet spectroscopy, are compiled in the blue 
distribution. The redshift distribution”* of millimetre-identified (mm-ID) 
starburst galaxies in the COSMOS survey is shown in red/orange, though the 
majority of redshifts in this sample are derived from optical/infrared 
photometry of the sources rather than spectroscopy, and therefore less certain. 
Sources at z< 1 were removed from the previous samples of starburst galaxies 
to better compare to the selection effect imposed on the SPT sample due to 
gravitational lensing. The distribution of redshifts for radio-identified sources is 
incompatible with the distribution for the sample presented in this work. This 
measurement demonstrates that the fraction of dusty starburst galaxies at high- 
redshift is greater than previously derived, and that radio-identified samples 
were biased to lower redshift than the underlying population. 
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derivation of the intrinsic redshift distribution for the SPT sample is 
reported elsewhere”. 

These 26 sources represent less than 25% of the recently completed 
SPT survey and catalogue. This newly discovered population of high- 
redshift starbursts will enrich our understanding of obscured star 
formation in the early Universe. Existing semi-analytic hierarchical 
models of galaxy evolution’*”® have already had difficulties reconciling 
the number of z>4 systems inferred from previous observational 
studies'’***. The presence of two intensely starbursting galaxies at 
z= 5.7, 1 Gyr after the Big Bang, in a sample of just 26 sources, demon- 
strates that significant reservoirs of dust and molecular gas had been 
assembled by the end of the epoch of cosmic reionization. As the 
millimetre-brightest high-redshift starbursts in the sky, the present 
sample will be key targets for ALMA studies of star-formation physics 
at high redshift. The gravitational lensing of these sources provides 
access to diagnostic information from molecular lines that would 
otherwise take hundreds of times longer to observe, and effective 
source-plane resolution several times higher than can otherwise be 
achieved. 
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A multi-directional backlight for a wide-angle, 
glasses-free three-dimensional display 


David Fattal', Zhen Peng’, Tho Tran!, Sonny Vo', Marco Fiorentino’, Jim Brug! & Raymond G. Beausoleil' 


Multiview three-dimensional (3D) displays can project the 
correct perspectives of a 3D image in many spatial directions 
simultaneously’ *. They provide a 3D stereoscopic experience to 
many viewers at the same time with full motion parallax and do not 
require special glasses or eye tracking. None of the leading multi- 
view 3D solutions is particularly well suited to mobile devices 
(watches, mobile phones or tablets), which require the combina- 
tion of a thin, portable form factor, a high spatial resolution and a 
wide full-parallax view zone (for short viewing distance from 
potentially steep angles). Here we introduce a multi-directional 
diffractive backlight technology that permits the rendering of 
high-resolution, full-parallax 3D images in a very wide view zone 
(up to 180 degrees in principle) at an observation distance of up to 
a metre. The key to our design is a guided-wave illumination tech- 
nique based on light-emitting diodes that produces wide-angle 
multiview images in colour from a thin planar transparent light- 
guide. Pixels associated with different views or colours are spatially 
multiplexed and can be independently addressed and modulated at 
video rate using an external shutter plane. To illustrate the cap- 
abilities of this technology, we use simple ink masks or a high- 
resolution commercial liquid-crystal display unit to demonstrate 
passive and active (30 frames per second) modulation of a 64-view 
backlight, producing 3D images with a spatial resolution of 88 
pixels per inch and full-motion parallax in an unprecedented view 
zone of 90 degrees. We also present several transparent hand-held 
prototypes showing animated sequences of up to six different 200- 
view images at a resolution of 127 pixels per inch. 

Ideally, a perfect 3D display would reproduce the set of all light rays 
(or lightfield’) from a 3D scene. Although standard holography can 
perform this task very well, the recording of a holographic medium is 
too slow to permit real-time operation®. Autostereoscopic multiview 
3D displays are ‘the next-best thing’: they discretize the lightfield into 
narrowly spaced views to create the illusion of continuous parallax and 
a 3D stereoscopic effect up to a certain viewing distance’. They can 
be realized using pure geometrical optics techniques such as multi- 
projector’, parallax barrier’, integral imaging* or a combination of 
these*?. Multi-projector solutions have demonstrated impressively 
large 3D images with many views and high resolution, but they are 
difficult to implement on a mobile device. Recent lenticular-based 
solutions (see http://www.alioscopy.com and http://www.dimencodis- 
plays.com) have seemed promising for TV applications owing to their 
compatibility with standard liquid-crystal displays (LCDs); however, 
they provide only a limited amount of head movement (with the sets of 
views repeating several times across the view zone), and low resolution. 

Diffractive optics offers another way of creating a multiview lightfield. 
Digital holographic displays use a spatial light modulator to update small 
holographic cells (sometimes called hogels'®), each generating a few 
light rays. Despite recent progress (http://www.zebraimaging.com), the 

required density of active pixels limits the field of view (FOV) and 
precludes the operation of the digital holographic displays at video rate. 
Using a static fringe pattern with an external modulator"! can solve the 
video rate problem for a more modest number of views. Still, because 


those diffractive approaches rely on the first-order diffraction of a highly 
collimated free-space light source, they are hard to integrate, have limi- 
ted power efficiency, and are plagued by the presence of specular light 
and unwanted diffraction orders (even for complex diffractive elements 
such as blazed or sub-wavelength gratings”). 

Here we introduce a diffractive backlight solution that seems well 
suited to realizing a multiview 3D display for mobile devices. It features 
(1) full motion parallax in a wide view zone (90° FOV demonstrated), 
(2) static “directional pixels’ (binary diffraction gratings) that can be 
manufactured at low cost and individually modulated at video rate, 
and (3) edge lighting by standard light-emitting diode (LED) light with 
a compact collimation system. In addition, it can render colour with- 
out the need for colour filters, allowing full transparency of the display 
(with transparent electronics). 

The cornerstone of our display architecture is a set of directional 
grating pixels shown schematically in Fig. 1. The gratings are etched or 
deposited on the backlight surface, and illuminated by collimated light 
that is guided in the backlight substrate by total internal reflection. As a 
result of first-order diffraction, light with input planar momentum 
kin = (Megp 0, 0)2171/A is scattered out of the backlight in a well-defined 
direction characterized by the normalized output vector kout = (1 My, 
nz)2t/A, where: 


Nx = Nee — (Al A) cos (1) 


ny = — (A/A)sing (2) 


Also, n,? + Pe +n =1, neg is the effective index of propagation of 
the input light along the x axis, / is the grating pitch and ¢ is the groove 
orientation with respect to the y axis. A group of directional gratings 
covering all views in the FOV forms a multiview pixel—the basic unit 
cell of our backlight. 

The guided nature of the input light (1.4 > 1) completely suppresses 
specular light in the FOV. It also repels higher diffraction orders away 
from the display normal, ensuring a wide view zone free of ghost 
images. In the Supplementary Information, we prove that the extent 
of the ‘clean’ view zone (measured in terms of the angle « from the 
vertical axis (z axis) is: 


Sind = Neg/2 (3) 


For a glass or plastic backlight, this corresponds to a FOV (2a) of 90° 
(and a high-refractive-index glass'* could be used to reach an FOV 
close to 180°). The guided light illumination is essential for another 
reason: it allows spatial multiplexing of three pixel sets that can be 
selectively addressed by changing the illumination angle. This impor- 
tant feature allows the spatial multiplexing of colour without using 
colour filters, potentially increasing the power efficiency and enabling 
a completely transparent architecture. This illumination technique 
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Figure 1 | Multi-directional backlight concept. a, Scanning electron 
micrograph of grating pixel fabricated by conventional photolithography. 

b, Schematic of the multi-directional backlight under collimated light 
illumination. ‘Directional’ grating pixels scatter the input light into the view 
zone of the display by first-order diffraction. Owing to the large planar 
momentum of the input light, unwanted diffraction orders fall far from the 
display normal. Under illumination by TE (transverse electric) polarized light, 
the output polarization is well preserved, which facilitates the external 
modulation of the backlight using commercial liquid-crystal technology. 0; 


also allows us to create animated sequences of 3D images in a very 
simple way. 

The use of diffractive pixels allows us to define almost arbitrary 
spatio-angular characteristics for the emitted lightfield. For example, 
we can increase the angular density near the normal direction and 
decrease it at sharper view angles. We can increase the number of 
horizontal views and reduce the number of vertical views. We can 
adopt a hexagonal angular grid (as in Fig. 1c) to obtain more uniform 
transitions between views. Unlike most solutions based on geometrical 
optics*, we have complete freedom to define the overlap between 
neighbouring views, allowing us to control the amount of between- 
view leakage and angular aliasing'*'*. More importantly, it permits us 
to mix standard, non-directional pixels (implemented as chirped 
broadband gratings) with directional pixels on the backlight surface 
to enable high-definition two-dimensional 2D content and lower- 
resolution 3D content simultaneously. 

The visual experience provided by a multiview 3D display can be 
characterized by its spatial resolution (multiview pixel size p), and 
angular resolution (between-view distance A@), or number of views 
N= (2sina/A0)*. The maximum distance at which the stereo effect can 
be continuously perceived in the FOV is z3p ~ 6.3cm/A0, where 
6.3. cm is the average human eye separation’®. The depth of field (p/ 
AO) measures the maximum altitude of an image that can be displayed 
at the native resolution p of the display with a disparity of less than one 
pixel between neighbouring views (larger disparity content will show 


Table 1 | Theoretical characteristics of diffraction-limited backlight 
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the angle of propagation of the input light in the backlight, with respect to the 
horizontal plane. c, Full-wave simulation of the radiation pattern from a 64- 
view backlight with 12-\1m-diameter red grating pixels under illumination by a 
collimated LED light source (20 nm spectral width, +3.5° collimation 
approximating our experimental lighting conditions). The upward flux is 
k,|E|’. d, Cross-section of c along the horizontal axis with individual view 
contributions (dotted black lines). The diffraction efficiency for each view is 
about 5% for 100-nm-deep grating grooves. The red arrow indicates the 
presence of a third-order diffracted beam outside the FOV. 


some blurring and ‘jumps’ between views’’). The effective resolu- 
tion at height z (where z = 0 is the surface of the display) is peg(z) = 
max(p, zA@). The maximum number of views that our backlight can 
accommodate is: 


N~ psina/(AB\n) (4) 


where n is the number of spatially multiplexed pixel sets (n = 3 for a 
red-green—blue (RGB) backlight) and f is a geometrical factor that 
depends on the spatio-angular pixel layout (/ = 1.22 for a close-packed 
square array of circular pixels, with a hexagonal distribution of views 
within the FOV and between-view separation corresponding to one 
Rayleigh width). Table 1 illustrates the imaging performance of a full- 
colour backlight with 90° FOV for various pixel sizes. The backlight can 
be improved by incorporating a set of ‘broadband’ pixels to increase the 
resolution of low-disparity content without decreasing the depth of 
field, or by using time-multiplexing schemes to increase the effective 
view number. Such refinements will be the topic of future study. 

In practice, the angular divergence of the views is broadened beyond 
the diffraction limit by the finite spectral and angular distribution 
of the input light. Figure 1c and d presents the simulated radiation 
pattern” of our 64-view backlight prototype, designed with an FOV of 
90° and inter-view separation 0 of 10° to accommodate an input-beam 
spectral width of 20 nm and angular width of +3.5°. The brightness 
variation across the FOV (centred around a diffraction efficiency of 5% 


3D imaging characteristics of a full-colour backlight 


Diffraction-limited performance* 


64-view experimental backlight of Fig. lc and d 


Multiview pixel size, p (mm) 0.25 
Number of views per colour, NV 133 
Angular separation between views, Ad (°) 7.0 
Diameter of each grating sub-pixel, D (tum) 12.5 
Stereo distance, Z3p (cm) 51 
Depth of field (mm) 2.0 
Effective resolution at height 2 cm, Petr (mm) 2.5 


0.5 1.0 0.288 
266 531 64 
5.0 3.5 10 
17.7 25.1 12 
73 103 36 
5.8 16.3 17 
1.7 1.2 3.5 


* Shown are three distinct designs with increasing pixel size, highlighting the trade-off between spatial resolution (pixel size p) and angular resolution (angular separation AQ). 
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Figure 2 | 3D images from a passively modulated backlight. a, b, Portable, 
transparent backlight prototype, with resolution 127 pixels per inch, 200 views, 
wide view zone of 90° (full parallax). The red HP logo in b is animated in a 
‘breathing’ motion by periodically cycling through one of six LED arrays (see 
Supplementary Video). c, Colour dithered 3D image obtained by masking dark 
pixels from a fully covered backlight, using a high-resolution ink-printed 
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Figure 3 | Actively modulated backlight. a, Schematics of a side-illuminated 
backlight actively modulated by an LCD shutter plane. The shutter plane must 
be located close enough to the backlight surface to avoid beam walk-off. 

b, Optical microscope image of the pixel arrangement on the backlight surface. 
Although all three RGB pixels were present on the backlight, we used only the 
red pixels for that prototype. The dotted white squares indicate the alignment of 
the liquid-crystal cell with respect to the pixel array. c, Multiview image 
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pattern. d-f, Steep angle views from the same backlight as c with a test pattern 
(colour ‘snow flakes’) showing 4 cm of rendered depth and colour mixing. Scale 
bar in d refers to e and f as well; scale bar in g refers to h and i as well. We note 
that the elements of the image with highest disparity (located at z = 2 cm) show 
some blurring, owing to the engineered view overlap of the backlight. 

g-i, Turtle 3D image viewed from various viewpoints. 


showing a different number in different view zones. Images corresponding to 
neighbouring views ‘bleed’ into each other owing to the designed view overlap, 
introduced to smooth out between-view transitions. d, Global view of the LCD 
modulated prototype. Red light from an external LED is expanded and 
collimated using free-space optics elements, and linearly polarized before it 
enters the backlight directly from a polished edge. The LCD shutter plane was 
thinned to 50 tm and aligned to the pixel array with an accuracy of about 2 [um. 
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for light diffracted above the grating, for 100-nm grating grooves) and 
the predicted view divergence (about 6.5° full width at half maximum, 
FWHM) compare well with the data reported in the Supplementary 
Information. 

We present a series of prototypes of increased complexity to show- 
case the features of our backlight technology. We fabricate hand-held 
devices featuring a transparent backlight with partial pixel coverage, 
able to project static 3D RGB images or animated sequences of up to six 
monochrome 3D images (90° FOV, 200 um resolution, 200 views per 
colour; see Fig. 2a and b). We also fabricate several 6-inch backlights 
with full pixel coverage (see Fig. 1c and d), on which we overlaid high- 
resolution binary masking patterns (ink on plastic from a high- 
resolution printer) to create larger 64-view images with colour mixing 
and dithering (Fig. 2c-i). 

Finally, we use the front plane of a Sony LCD pico-projector to 
modulate a 1-inch 64-view backlight at a rate of 30 frames per second. 
As shown in Fig. 3, the backlight contains a 720 by 720 array of grating 
pixels, with a pitch of 36 um. Because the gratings in the backlight do 
not significantly change the polarization of horizontally (transverse 
electric) polarized light, only the output polarizer is needed for the 
LCD. Figure 3d shows a multiview image created by the LCD in which 
each view displays a different number from 1 to 64. To avoid ‘beam 
walk-off from the grating pixels to their respective liquid-crystal cell, 
the top glass cover needs to be thinned to about 20 jum. The resulting 
devices were not strong enough for extensive testing, so here we report 
the operation of a stable device featuring a LCD cover thickness of 
about 50 um, allowing the modulation of 14 of the original 64 views 
(the unmodulated views are greyed out on Fig. 3d). The pictures taken 
of the 14 central views show a small amount of between-view mixing, 
in qualitative agreement with the simulation of Fig. 1c and d. At the 
centre of views 12 or 28, for instance, no leakage from neighbouring 
views is discernible. In view 27, the view point leans towards view 26 
and the between-view leakage is apparent. See the Supplementary 
Information for videos of all static and active demonstrations. 

Our multi-directional backlight provides the basis for a very effi- 
cient display, because it does not require colour filters, and can use 
multiple passes of the incident light, much as in a standard LCD. In its 
present form, the intensity of the input light decreases from the edge to 
the centre, which can be compensated in a variety of ways (for 
example, by adjusting scattering strength via groove depth or area 
variation). We are currently working on other pixel modulation meth- 
ods that do not require polarized light and should increase the effi- 
ciency even further. 

We believe that the unique combination of wide-angle 3D perform- 
ance, high spatial resolution, ease of modulation at video rate, compact 
form factor and low manufacturing cost make our multi-directional 
backlight technology very promising for multiview 3D mobile display 
applications. 


METHODS SUMMARY 


The glass backlight prototypes presented in this paper are made of Schott glass 
B270, covered by a 100-nm layer of silicon nitride fabricated by plasma-enhanced 
chemical vapour deposition. Some of the static images were defined by electron- 
beam lithography using a 200-nm PMMA photoresist layer covered by 2 nm of 
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evaporated chrome. The nitride layer was etched in an Oxford reactive ion etching 
system using chlorine chemistry. The 720 X 720 grating pixel array for the LCD 
modulated backlight was defined by a 193-nm deep-ultraviolet photolithography 
process. The finite-difference time domain simulations were performed on a 
Hewlett-Packard-owned computer cluster using the parallel version of MEEP’” 
with 256 simultaneous cores. 
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Changes in global nitrogen cycling during the 


Holocene epoch 


Kendra K. McLauchlan!, Joseph J. Williams', Joseph M. Craine? & Elizabeth S. Jeffers* 


Human activities have doubled the pre-industrial supply of react- 
ive nitrogen on Earth, and future rates of increase are expected to 
accelerate’. Yet little is known about the capacity of the biosphere 
to buffer increased nitrogen influx. Past changes in global eco- 
systems following deglaciation at the end of the Pleistocene epoch 
provide an opportunity to understand better how nitrogen cycling 
in the terrestrial biosphere responded to changes in carbon cycling. 
We analysed published records of stable nitrogen isotopic values 
(8°N) in sediments from 86 lakes on six continents. Here we show 
that the value of sedimentary 5'°N declined from 15,000 years 
before present to 7,056 + 597 years before present, a period of 
increasing atmospheric carbon dioxide concentrations and terres- 
trial carbon accumulation”. Comparison of the nitrogen isotope 
record with concomitant carbon accumulation on land and nitrous 
oxide in the atmosphere suggests millennia of declining nitrogen 
availability in terrestrial ecosystems during the Pleistocene- 
Holocene transition around 11,000 years before present. In con- 
trast, we do not observe a consistent change in global sedimentary 
5'°N values during the past 500 years, despite the potential effects 
of changing temperature and nitrogen influx from anthropogenic 
sources. We propose that the lack of a single response may indicate 
that modern increases in atmospheric carbon dioxide and net 
carbon sequestration in the biosphere have the potential to offset 
recent increased supplies of reactive nitrogen in some ecosystems. 

Humans have drastically altered the global nitrogen (N) cycle by 
doubling rates of biotic and abiotic N, fixation. Consequently, supplies 
of reactive N to many terrestrial ecosystems have increased’, and are 
projected to continue to increase globally by 10-15% within 25 yr (ref. 
4). This increase in reactive N has led to suggestions that Earth has 
crossed a critical boundary regarding the N cycle’. However, despite 
the certainty of increased supply rates, it is unclear whether N avail- 
ability—the supply of N relative to demand by plants and microbes— 
has increased on a global scale. Uncertainty about changes in net 
ecosystem response to increased reactive N hinders predictions of 
the future function of important ecosystem processes, such as primary 
productivity, decomposition and nitrate export to waters. Recent mea- 
surements of N availability on sub-annual to decadal timescales have 
demonstrated conflicting patterns. Although some areas of high depo- 
sition have shown signs of increased N availability’, many terrestrial 
ecosystems have shown a surprising capacity to retain added reactive N 
without evidence of adverse effects, and N availability has apparently 
declined in other ecosystems during the past few decades’. 

Given the uncertainty about changes in terrestrial N availability, 
extending records of N cycling before human manipulation of the N 
cycle would be useful. A variety of empirical correlations® and theor- 
etical models’ indicate that the variation in past N cycling can be 
captured in the 8N values of leaves!®, wood"!, soils!? and sediments'*. 
Of these, lacustrine sediments provide the longest-term reconstruc- 
tions and integrate over the largest area, although they also integrate 
over the greatest number of processes that can affect the ultimate 
isotopic values. The basic premise is that 8'°N measured in organic 


material integrates fractionation processes, such that 5'°N values pro- 
vide an index of the processes in the N cycle that discriminate against 
*N, particularly gaseous loss pathways'’. When N supply is high 
relative to biotic demand, N is lost through fractionating pathways 
and the remaining ecosystem N is enriched in '°N. In the few studies 
that have measured several ecosystem compartments, values of ter- 
restrial 5'°N, stream 5'°N in NO; , and lacustrine sedimentary 5>N 
were correlated’*"*. 

To better understand the patterns of N cycling over century to 
multi-millennial timescales, we compiled published records of sedi- 
mentary 5'°N from 86 lakes distributed on six continents to assess 
changes in ecosystem-level N cycling over the past 15,000 yr (Fig. 1). 
This time period encompasses the Pleistocene—Holocene transition—a 
period of rapid warming, atmospheric CO, increases, and primary 
succession in newly deglaciated landscapes—and also includes the past 
500 yr during which there have been unprecedented anthropogenic 
global changes. For inclusion in this analysis, sites had to meet mini- 
mum criteria for sampling density and duration of sedimentary record, 
leading to sample sizes of 37 sites for the analysis from 15,000 years 
before present, and 58 sites for the 500-year analysis. The original 
chronology of each study was reassessed and expressed in calendar 
years before present (cal. yr BP), where present (0 cal. yr Bp) is defined 
as AD 1950. Sites are distributed broadly over the globe, representing a 
variety of climates, biomes and degrees of human impact (Fig. 1, 
Supplementary Fig. 3). Modern mean annual precipitation at these 
sites ranges from 118 to 2,516mm and mean annual temperature 
ranges from —17 to 26 °C. 

During the Pleistocene-Holocene transition when global terrestrial 
ecosystems were accumulating C, sedimentary 5'°N declined globally 
for approximately 8,000 yr (Fig. 2a). From 15,000 cal. yr Bp to an inflec- 
tion point at 7,056 + 597 cal. yr Bp, sedimentary SDN declined at a rate 
of 0.25 + 0.03% kyr’ (P<0.001). Of the 17 sites with declining 
trends during this period, 11 were significant, whereas only 3 of the 
10 increasing trends were significant. The sites with declining records 
were located in both tropical and temperate areas and both hemi- 
spheres; some had been glaciated and some had not. Since the mini- 
mum value at 7,050cal.yrsp, global sedimentary 5'°N has been 
increasing at a gradual rate of 0.08 + 0.04%okyr ' (P< 0.05). This is 
32% of the rate of decline during the previous 8,000 yr. 

Despite the general trend for increasing sedimentary 5'°N over the 
past 7,000 yr, high-resolution modern sedimentary 5'°N records show 
no evidence of consistent changes in global N cycling during the past 
500 yr (Fig. 2b). Analysis of a larger set of lakes with higher sampling 
density, during the time window from AD 1500 to AD 2005, shows that 
sedimentary 5'°N has not been significantly changing during recent 
times (n = 58, P = 0.2). Only 28 of the 58 sites in the analysis of the past 
500 yr have sedimentary 5’°N records that show a significant increase 
or decrease over time. Sedimentary 5’°N has significantly increased at 
10 sites and significantly decreased at 18 sites since AD 1500. 

Regression analysis suggests that both climatic and anthropo- 
genic variables affected the direction of change in sedimentary 8'°N 
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Figure 1 | Locations of the 86 sites analysed for lacustrine sedimentary 5°N 
in this study. Red indicates a sedimentary record that was analysed only for the 
past 500 yr, yellow indicates a record used for the 15,000 cal. yr BP analysis, and 


at individual sites during the past 500 yr (Fig. 3). Overall, lakes from 
warm, low-elevation regions are more likely to have increasing values 
of sedimentary 5'°N towards the present, while those from cold, high- 
elevation regions are more likely to have decreasing values of sedi- 
mentary 5'°N towards the present. For example, the sediments from a 
typical lake at 3,000 m where mean annual temperature was — 10°C 
would have decreased in 5'°N by 1.3% per century for the past 500 yr, 
while sediment from lakes at sea level where mean annual temperature 
was 25 °C would have increased by 1.5%o per century over the same 
time period. Anthropogenic factors also influence observed trends of 
sedimentary 5°N. Lakes in regions with a large human footprint—as 
calculated from population density, land transformation, accessibility 
and electrical power infrastructure'°—are more likely to have sedi- 
mentary 5’°N that decreased towards the present. 

The global patterns of sedimentary 5'°N during the past 15,000 yr 
are consistent with a strong influence of terrestrial C sequestration on 
the global N cycle. The prolonged retreat of continental ice sheets in 
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Figure 2 | Changes in lacustrine sedimentary 5'°N during the late 
Pleistocene and Holocene. a, A smoothing spline curve (0.05 smoothing 
parameter) fitted to the means of sites in 100-yr bins is shown (red) with 95% 
bootstrapped confidence intervals (grey). Declines in sedimentary 8'°N from 
15,000 cal. yr BP to the breakpoint at 7,056 + 597 cal. yr BP correspond with 
periods of global net terrestrial carbon gain (shaded green)*. Dotted black line is 
the breakpoint regression. b, A different set of high-resolution sedimentary 
5'°N records shows no net change over the past 500 yr. 


orange indicates a site that met analytical criteria for both time periods. Sites 
with published sedimentary 5'°N records that were not included in this study 
are coloured white. Full site information is available in Supplementary Table 1. 


the Northern Hemisphere during the Pleistocene-Holocene transi- 
tion exposed approximately 5 X 10'* m° of land area’ that accumu- 
lated C and N after colonization by plants. During the early Holocene, 
~600 Pg of C were sequestered in terrestrial ecosystems’’, with strong 
net terrestrial C accumulation from 15,000 to 7,000 cal. yr BP (ref. 2). 
The N budgets that accompanied this global C sequestration are 
almost completely unknown”, although several local- or regional-scale 
studies provide estimates of rapid rates of N accumulation associated 
with primary succession’*’’. On the global scale, if the additional 
organic matter had a C:N ratio of 100 (ref. 14) it would have required 
6,000 Tg of N to accumulate the organic matter from the Last Glacial 
Maximum to pre-industrial Holocene. 

Two underlying mechanisms associated with terrestrial C sequest- 
ration could be contributing to the broad declines in sedimentary 5'°N 
seen from 15,000 to 7,000 cal. yr Bp (Fig. 4). First, increased N> fixation 
would add N to local terrestrial ecosystems with an isotopic signature 
initially near 0%o, thereby potentially reducing sedimentary 5'°N 
values” despite increasing terrestrial N supplies. Under equilibrium 
conditions, the addition of fixed N often leads to increases in ecosys- 
tem 8’°N by increasing N supply and fractionating losses”’; however, 
if ecosystems are accumulating and sequestering C then N> fixation 
can reduce ecosystem 5'°N on century to millennial timescales”. 
Whether N>-fixation during the initial stages of primary succession 
after deglaciation increased or decreased ecosystem 5'°N at individual 
sites is an open question; however, we found that sites which were not 
glaciated during the Last Glacial Maximum also exhibited declines 
in sedimentary 5'°N from 15,000 to 7,000 cal. yr Bp, thus additional 
mechanisms were probably driving the decline in 5'°N values. 

Alternatively, C sequestration during the early Holocene could have 
reduced local terrestrial N availability, which would be reflected in 
lower values of ecosystem 3!°N (that is, foliage and soil) and of the 
N transported to lakes. The proposed hypothesis that the 8'°N of 
lacustrine sediments reflects reduced N availability is further sup- 
ported by observed declines in atmospheric concentrations of N,O 
during the early Holocene (Supplementary Fig. 7). N2O is produced 
by the fractionating gaseous loss pathways of nitrification and denit- 
rification in soils with high N supply relative to demand. The Holocene 
minimum of atmospheric NO occurs between 8,000 and 6,000 cal. yr BP 
(ref. 23; Supplementary Fig. 7), temporally coincident with the mini- 
mum in sedimentary 5'°N, and it has potentially been linked to 
reduced terrestrial NO production in soils. 
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Figure 3 | Three local environmental variables explain site-specific 
trajectories of sedimentary 5'°N in the past 500 yr. a-c, Partial residual plots 
are shown between mean annual temperature (MAT; partial r=0.11, 
P<0.005; a), elevation (partial r = 0.08, P< 0.01; b) and a human footprint 
index (partial 1” = 0.1, P<.0.005; c) and the slope of site-specific regressions 


Any reduction in N availability due to net C sequestration probably 
would have been amplified by increasing atmospheric CO, concentra- 
tions during the Pleistocene-Holocene transition. Atmospheric CO 
concentrations increased from 188 p.p.m.v. at ~15,000 cal. yr BP to 
260 p.p.m.v. by 10,000 cal. yr BP (ref. 24). This sustained increase in 
atmospheric CO, could have generated progressive nitrogen limita- 
tion” in plants as their demand for N increased with the higher levels 
of productivity and N was sequestered (along with C) in plant biomass 
and soil organic matter pools. This could have contributed to lowered 
N availability in terrestrial ecosystems for thousands of years. The 
effects of local increases in N> fixation and reduced N availability are 
not mutually exclusive, and in fact the processes are linked. Low N 


Figure 4 | Conceptual diagram of hypothesized drivers of isotopic 
signatures of sedimentary N. a, During the early Holocene; b, modern times. 
Grey arrows show processes that lower the 8'°N of products relative to inputs; 
white arrows show processes that do not alter 5'°N of products. The isotopic 
signature of N as it is fixed from the atmosphere (N> fix) is close to 0%. C 
sequestration (C,.q) and increased atmospheric CO, concentrations could have 
lowered terrestrial N availability (—N,yai1) that would have led to fluxes of N to 
the atmosphere and lakes relatively depleted in '"N. The quantity of modern 
anthropogenic N addition is probably large enough to maintain similar N 
availability (=Nayail) to ecosystems on a global scale. 
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between sedimentary 5'°N (5'°N,eq) and cal. yr BP for the past 500 yr at 58 lakes. 
Sites with declining sedimentary 5'°N values towards present have low MAT, 
high elevation and high human footprint index. Statistical results for the overall 
model are shown in Supplementary Table 2. 


availability can stimulate N> fixation’”®. Both processes could affect 
the signatures of sedimentary 5'°N on a global scale by affecting the 
signature of gaseous N losses and atmospheric reactive N. For example, 
the 8!°N of nitrate deposited on the Greenland Ice Sheet declined from 
15 to 7 cal. kyr bp (ref. 27). In addition to the direct effects of increased 
atmospheric CO, on local N cycling, these teleconnections could also 
contribute to declines in sedimentary 8'°N in non-glaciated lakes such 
as those in eastern Africa. 

The increased supply of reactive N to terrestrial ecosystems that 
accompanied the modern doubling of reactive-N production has been 
the major change in the global N cycle during the past 500 yr (ref. 3). 
One might expect an increase in sedimentary 5'°N if demand by the 
terrestrial biosphere could not keep pace with the increased supply 
of reactive N. Yet, global increases in sedimentary 5'°N during the past 
500 yr in this study were not observed. One potential explana- 
tion for this might be that the modern terrestrial biosphere has 
been sequestering C (~4PgCyr '; ref. 28), as occurred during the 
Pleistocene-Holocene transition. Accompanying this C sequestration, 
N sequestration would be approximately 36 TgN yr_', depending on 
the ecosystem C:N ratio'*. Although this rate of N sequestration is 
~20% of the ~185TgNyr ‘ that humans add to the biosphere’, 
terrestrial C sequestration and increases in atmospheric CO, concen- 
trations could be limiting wide-scale increases in N availability. 
Alternatively, the introduction of large amounts of reactive N with 
an atmospheric signature could be balancing the isotopic effects of 
fractionation due to gaseous losses. The lack of a globally consistent 
trend in sedimentary 5'°N over the past 500 yr suggests that the ulti- 
mate outcome of these processes is being primarily controlled by local 
conditions. This multiplicity of responses highlights the need to under- 
stand local and regional trends in sedimentary 3?N (ref. 29), in parti- 
cular to reduce uncertainties in local and global controls over the N 
budget and to trace the fate of deposited reactive N in terrestrial and 
aquatic ecosystems. 

Our results add to the growing evidence that N supply and N 
demand must be considered jointly to understand the N status of 
the terrestrial biosphere. Global-scale N cycling changed long before 
humans significantly modified the N cycle, therefore any recent 
changes to the N cycle should not necessarily be interpreted as a direct 
effect of increased anthropogenic reactive-N production. Global bio- 
geochemical cycles responded slowly to massive changes in planetary 
climate systems; it took millennia for the global N cycle to adjust to the 
removal of ice sheets and reorganization of the global C cycle. The 
dynamics of these slow processes, especially the accuracy of C accrual 
estimates on millennial timescales, can be studied through parameteri- 
zation of C and N cycling in global vegetation and ecosystem models 
constrained by N isotopic data*®. Overall, these data indicate an 
important role of net terrestrial C balance in maintaining the nutrient 
status of the modern biosphere. 
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METHODS SUMMARY 


Data compilation. Published data on lacustrine sedimentary 5'°N were obtained 
from the corresponding author or directly from the publication. Chronologies 
were constructed using both 7!°Pb and '*C dating approaches with other fixed 
date horizons such as tephra. Raw radiocarbon ages were recalibrated and new age 
models were derived. The isotopic ratio of N (8'°N) is expressed relative to the 
standard of atmospheric No. 

Sedimentary 5'°N over time. Sedimentary 5'°N data from each site were grouped 
into bins to standardize sampling density (Supplementary Fig. 5). For the past 
500-yr time window, sites had to meet a minimum number of 8 sample points and 
a duration of at least 75 yr (n = 58 sites). Bin size was 10 yr. For the 15,000 yr Bp 
time window, bin size was 100 yr and site criteria included a minimum of 10 
sedimentary 5'°N samples in 10 separate bins and a 1,000 yr duration (n = 37). 
Nine sites met criteria for both analyses (total n = 86). 95% confidence intervals 
for mean values of sedimentary 8'°N were calculated by bootstrapping over 1,000 
replications, with samples replaced. Breakpoint analysis was performed to identify 
inflection points. 

Site characteristics that influence recent trajectories. Linear regressions 
between sediment age (cal. yr BP) and untransformed bulk sedimentary 5'°N were 
performed for each site for the most recent 500 yr. Then, a stepwise forward 
multiple regression was performed to test the roles of site-specific variables in 
predicting the direction and rates of change over time among sites. Tested pre- 
dictor variables included elevation (derived from the publication or a global eleva- 
tion model), modelled historic and modern atmospheric N deposition levels®, 
mean annual temperature and mean annual precipitation, and a calculated 
human footprint index ranging from 0 (low impact) to 100 (high impact)!*. 
Note that these variables reflect modern, not past, site characteristics. Variables 
were retained with a significance level of «<0.05. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Literature search. The search was conducted using the Web of Science database 
with keywords ‘nitrogen isotopes’ and ‘lake sediments’, ‘delta (15)N’ and ‘lake 
sediments’, and ‘paleo’ or ‘palaeo’ and ‘nitrogen’ in August 2011. Of the 627 
records returned as a result of these searches, 192 met the criteria of providing 
well-dated stable isotope data from lacustrine sediments. All studies reported bulk 
sedimentary 5'°N data expressed in standard delta notation relative to the isotopic 
composition of air’'. Corresponding authors were contacted with a request for the 
data published in each manuscript, and in cases where an author could not be 
contacted and/or the original data files could not be located, the published data 
(sedimentary 5'°N and chronology) were obtained from the manuscript. In the 
case of duplicate cores from a single lake, the manuscript was examined and the 
oldest single core was chosen. This process led to a sample size of 98 sites. If a 
manuscript mentioned acid pre-treatment of sediments before mass spectrometry, 
that site was excluded due to possible biases”. Further criteria were applied 
regarding temporal density of sampling to meet analytical requirements, such that 
the final number of sites used in this analysis was 86 lacustrine sediment records. 
Site information is available in Supplementary Table 1 and these references** **. 

Establishment of chronologies. To standardize age models for all sedimentary 
records, recalibration of existing radiometric ages was performed where possible. 
Radiocarbon ages were calibrated using the most recently produced calibration 
curves. SHCal04”” was used for Southern Hemisphere ages 0-11 cal. kyr Bp. 
IntCal09'” was used for all Northern Hemisphere records and Southern Hemi- 
sphere ages older than 11 cal. kyr Bp. For those sites where a new chronology was 
created, classical age-depth models were generated using CLAM"! run in the 
open-source statistical software R'°’. Several age-depth models were created 
for each site and the choice of model used was based upon the goodness of fit, 
the recommendation from the original published manuscripts and the interpreta- 
tion of the authors. Chronologies spanning the past 500 yr were often based 
partially or entirely on 7!°Pb activity using a constant rate-of-supply model. 
These ages were not reassessed, which might in the case of imprecise original 
chronologies affect the coherence of the higher-resolution ‘short cores’. In order 
to standardize ages and to use the most accurate chronological constraint possible, 
all ages are given as years before present (cal. yr BP) in which ‘0’ refers to AD 1950. 
Any ages more recent than this date are displayed as negative values (that is, AD 
2012 = —62 cal. yr BP). 

Site characteristics and additional variables. Latitude and longitude of each 
lake were acquired from the published manuscripts, as well as the surface area 
and elevation when provided. Two climate variables—Biol (mean annual tem- 
perature) and Biol2 (mean annual precipitation)—were obtained from the 
WorldClim data set which is presented at 1km spatial resolution and averages 
AD 1950 to 2000 (http://www.worldclim.org/)'®*. Estimates of nitrogen deposited 
to terrestrial ecosystems from the atmosphere in 1993 and 1860 were obtained from 
(http://daac.ornl.gov/CLIMATE/guides/global_N_deposition_maps.html)'™*. Ele- 
vation, where not provided in manuscripts, was obtained from a global elevation 
model’. The human footprint metric is an index of degree of human in- 
fluence on terrestrial ecosystems, calculated from data on population density, 
land transformation, accessibility and electrical power infrastructure. It ranges 
from 0 to 100 (ref. 106). 

Additional geochemical data from the sediment records—N concentrations, 
C concentrations, 8'°C and C:N—do not correlate strongly at a global scale to 
sedimentary 5'°N values or the rate or direction of change over time (Sup- 
plementary Fig. 4). There are relationships among geochemical variables at indi- 
vidual sites”!°”. 


Statistical analyses. Because sedimentary 5'°N is already measured relative to 
a global standard, minimal data manipulation was required to standardize the 
records. Two time windows were selected for analysis: (1) past 500 yr when 
atmospheric CO, concentrations have increased by 100 p.p.m.v. (from 290 to 
390 p.p.m.y.)'°*, and (2) the past 15,000 yr. While some sediment records were 
older than 15,000 yr, the data density began to degrade significantly before this 
time. The latter time period represents an increase in atmospheric CO? approxi- 
mately 50% of the recent increase, from 188.5 p.p.m.v. at 17,073 cal. yr BP gas age to 
238.6 p.p.m.v. at 13,440 cal. yr BP gas age’. 

To determine the pattern of sedimentary 5'°N over time, we performed piece- 
wise linear regressions using the ‘segmented’ package in R''®. The existence of an 
inflection point, and a significant difference in slope, was tested using the Davies’ 
test. The position, along with standard error and confidence intervals, of the 
inflection point was estimated (Fig. 2). Confidence intervals of 95% levels 
(Fig. 2) were generated by a bootstrap approach in which values for individual 
sites per bin window were sampled with replacement over 1,000 replications. 
The bootstrap approach allows assessment of the sensitivity of the results to the 
inclusion or exclusion of individual sites. The bootstrapping was performed in R, 
using the package ‘Boot’!"’. 
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Melt-rich channel observed at the 
lithosphere-asthenosphere boundary 


S. Naif!, K. Key’, S. Constable! & R. L. Evans? 


The lithosphere-asthenosphere boundary (LAB) separates rigid 
oceanic plates from the underlying warm ductile asthenosphere. 
Although a viscosity decrease beneath this boundary is essential for 
plate tectonics, a consensus on its origin remains elusive. Seismic 
studies identify a prominent velocity discontinuity at depths 
thought to coincide with the LAB but disagree on its cause’, 
generally invoking either partial melting® or a mantle dehydration 
boundary’ as explanations. Here we use sea-floor magnetotelluric 
data to image the electrical conductivity of the LAB beneath the 
edge of the Cocos plate at the Middle America trench offshore of 
Nicaragua. Underneath the resistive oceanic lithosphere, the mag- 
netotelluric data reveal a high-conductivity layer confined to 
depths of 45 to 70 kilometres. Because partial melts are stable at 
these depths in a warm damp mantle’, we interpret the conductor 
to be a partially molten layer capped by an impermeable frozen lid 
that is the base of the lithosphere. A conductivity anisotropy 
parallel to plate motion indicates that this melt has been sheared 
into flow-aligned tube-like structures’. We infer that the LAB 
beneath young plates consists of a thin, partially molten, channel 
of low viscosity that acts to decouple the overlying brittle litho- 
sphere from the deeper convecting mantle. Because this boundary 
layer has the potential to behave as a lubricant to plate motion, its 
proximity to the trench may have implications for subduction 
dynamics. 

The passive magnetotelluric method uses naturally occurring mag- 
netic fields and induced electric fields to probe mantle electrical con- 
ductivity structure. Because pure melts are at least two orders of 
magnitude more conductive than a typical peridotite, mantle conduc- 
tivity can be greatly increased by small amounts of partial melt’®, 
whereas conductivity depends to a much lesser extent on mantle tem- 
perature and hydration state’’. This large conductivity contrast makes 
magnetotelluric soundings highly sensitive to the presence of mantle 
partial melts. 

We deployed 50 broadband electromagnetic stations across a single 
280-km-long profile that extended from the Cocos plate abyssal plain, 
across the trench, and onto the continental shelf, amounting to the 
largest subduction zone electromagnetic deployment to date (Fig. 1). 
Data from high-frequency controlled-source electromagnetic profiling 
revealed that the pervasive extensional bending faults on the trench 
outer rise are porous channels that allow for deep crustal hydration 
and serpentinization of the uppermost mantle’. The lower-frequency 
magnetotelluric data considered here target deeper conductivity struc- 
ture in the lithosphere and asthenosphere. We extended the instru- 
ment array far onto the abyssal plain to image what we expected to be 
unremarkable conductivity beneath the oceanic plate, which would 
provide a baseline for comparisons with conductivity features found 
beneath the trench axis and continental margin. 

We converted the observed magnetotelluric responses into a two- 
dimensional conductivity model using a nonlinear regularized inversion 
that solved for the triaxially anisotropic conductivity tensor of 10,000 
mesh cells'*"*, Figure 2a shows the horizontal resistivity (reciprocal of 


conductivity) component aligned in the direction of plate motion. All 
three tensor components are shown in Supplementary Fig. 3. 

Landward of the trench, the magnetotelluric data reveal a resistive 
subducting slab and mantle, with a notable correlation between the 
location of earthquakes and high resistivity (Fig. 2a). The widespread 
distribution of earthquakes indicates that the brittle slab is rupturing 
over a wide depth range (rather than the rupturing being concentrated 
along the plate interface), consistent with high resistivity indicating a 
relatively cold, fluid-free mantle. 

The conductivity becomes laterally uniform westward of the trench. 
A veneer of low-resistivity sediments and extrusive volcanics overlie a 
highly resistive lithosphere, typical of oceanic plate structure’. Below 
this, we find an anomalously conductive, horizontally extensive layer 
of 4-6Qm at 45-70km depth. This conductor extends at least to 
the western edge of the profile where our magnetotelluric array ends, 
while its eastern edge lies beneath the trench outer rise wall. The 
anomalous layer is 1.5-2 times more conductive in the direction 
parallel to plate motion than parallel to the trench axis (Fig. 2b.) The 
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Figure 1 | Regional tectonic map and location of the magnetotelluric 
survey. The 280-km magnetotelluric profile (boxed) crosses the Middle 
America trench offshore of Nicaragua, where 23-24-Myr-old Cocos plate 
subducts beneath the Caribbean plate at a rate of 85 mmyr_ |. The black dashed 
line separates the portions of the Cocos plate produced by either the East Pacific 
Rise (EPR) or the Cocos-Nazca spreading centre (CNS). The 50 sea-floor 
magnetotelluric stations were deployed at 10- and 4-km intervals in water 
depths ranging from 60 to 5,150 m and recorded magnetotelluric signals for 
19 days during the month-long survey. 
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Figure 2 | Resistivity model obtained from anisotropic inversion of the sea- 
floor magnetotelluric data. At the top is the surface view; arrows show the 
direction of north and p, (see below), and inverted triangles denote seafloor 
magnetotelluric station locations. a, The electrical resistivity in the direction 
parallel to plate motion (p,). The colour scale gives log[p, (Q m)], with blue and 
red colours corresponding to resistive and conductive (less resistive) features, 
respectively. The dark red line is a model of the top of the subducting slab””. 
Earthquake hypocentres from up to 50 km off-axis are shown as black circles 
(from the USGS/NEIC catalogue). The region enclosed by the dashed black line 
is where the model is at least 1.5 times more conductive in the direction parallel 
to plate motion. b, Resistivity ratio for the plate-motion-parallel (py) to trench- 
axis-parallel (p,.) model components. The colour scale gives log(p,/p,), and 
the plot shows the strong anisotropy of the conductive layer at 45-70 km depth 
(red regions >150 km offshore). Although the lithosphere above shows a 
strong anisotropy, we warn that this is not well constrained, because the 
magnetotelluric method is primarily sensitive to conductive rather than 
resistive features *°. The deeper mantle beneath the conductive layer is isotropic, 
suggesting it is not being sheared. 


deeper asthenosphere is isotropic, with resistivity 10-20 Qm. Model 
sensitivity studies indicate that the conductance (the product of con- 
ductivity and thickness) of the anisotropic conductive layer and the 
conductivity of the underlying isotropic asthenosphere are well con- 
strained by the data (Supplementary Fig. 4). Because the magnetotel- 
luric data are primarily sensitive to the layer conductance, the response 
of the observed anomalous layer, which is 25 km thick, is nearly equi- 
valent to that from a 12.5 km layer with a conductivity twice as large; 
however, we favour the thicker layer found by the inversion because its 
higher resistivity requires less partial melt to explain it. 

The anomalous layer is too conductive to be consistent with localized 
temperature variations, but could be explained by either a hydrated 
mantle or a small degree of partial melt. Mantle olivine with a water 
content of 800 parts per million (p.p.m.) has a similar conductivity to 
the observed layer’? (Fig. 3a), but such a high water content would lead 
to significant partial melting over this depth range’® (Fig. 3c). This leads 
us to conclude that the conductive layer must contain partial melt, 
arising from a mantle with a much lower degree of hydration. 

Assuming a 23-Myr-old oceanic lithosphere with a mantle potential 
temperature of 1,420 °C, a partially molten peridotite matrix is stable 
beneath our profile (Fig. 3c)'*'”. The top of the melting region at 45 km 
depth agrees with the intersection of the 23-Myr-old oceanic plate 
geotherm and a damp solidus containing 275 + 85 p.p.m. H,O, sig- 
nifying the minimum water content required for the stability of melt 
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to coincide with our observed high-conductivity layer. This intersec- 
tion may culminate in a freezing front where melt solidifies owing to 
the colder temperatures above. We infer that this freezing front forms a 
permeability barrier that traps buoyant melt beneath, rather than 
allowing it to percolate to shallower depths. This barrier is further 
reinforced by the higher viscosity of the uppermost mantle above; 
this higher viscosity occurs because the uppermost mantle has been 
depleted of volatiles during upwelling and melting at the mid-ocean 
ridge’*. This upper boundary may also represent a sharp increase in the 
solubility of water in mantle minerals, resulting in a corresponding 
sharp decrease in the stability of partial melt’. 

Estimates derived from samples of mid-ocean ridge basalts give 
mantle H,O contents of 50-200 p.p.m., which are too low for the 
275 p.p.m. H,O needed to sustain partial melts at 45km depth. 
However, uncertainties in partition coefficient measurements may 
accommodate this discrepancy; the resulting statistical lower bound of 
190 p.p.m. H,O is within the range of estimates from mid-ocean-ridge 
basalts. We do not consider the effects that other volatiles (specifically, 
CO;) have on the solidus, which could further reduce the concentra- 
tion of water required to induce silicate melts at 45-70-km depth®. 

The electrical anisotropy indicates that the melt is made up of a 
network of tubes or elongated spheroids aligned in the direction of 
plate motion’. Hence, we estimate the amount of partial melt present 
by using a parallel mixing model appropriate for anisotropic conduc- 
tivity, together with results of recent laboratory studies of hydrated 
basaltic melt conductivity’, yielding melt fractions of 1.0-2.1% for a 
mantle containing 275 p.p.m. H,O (Fig. 3b). Further reductions in 
melt estimates are possible, but require water concentrations that 
may be unrealistically high. For instance, 0.3% melt is attained for a 
mantle with 720 p.p.m. H2O. Likewise, carbonatite melt is significantly 
more conductive and thus yields even smaller fractions, but is an 
unlikely explanation because it is unstable at these depths®. Bulk con- 
ductivity in the deeper isotropic asthenosphere is best described by the 
Hashin-Shtrikman upper bound, which predicts well-connected melt 
fractions of 0.2-1.1% for the observed 10-20 Q m mantle. 

Deep off-axis melt emplacement can occur during lithosphere 
formation at the ridge or result from accumulation of a small degree 
of intraplate melting®. An earlier magnetotelluric experiment at the 
southern East Pacific Rise spreading ridge observed an off-axis con- 
ductive asthenosphere at 60-120 km depth beneath a resistive litho- 
sphere**'. We interpret that conductive layer as requiring partial melt, 
because its conductivity and anisotropy are similar to observations 
beneath our profile (Fig. 3b). That portion of the ridge generates the 
Nazca plate; if we assume equivalent ridge processes are occurring for 
the Cocos plate, where our profile was measured, then most of the 
hydrous melt has been emplaced at or near the ridge axis. Additional 
melt may accumulate through deeper intraplate melting—for example, 
from small-scale convection”. Grain boundary migration of deeper 
partial melt** carries with it increased water content as a result of 
the preferential partitioning of water into silicate melts. As enriched 
hydrated melt rises, it collects beneath the colder, less permeable litho- 
sphere™ and shears into a network of interconnected horizontally 
aligned melt bands”, possibly driven by large-scale asthenospheric 
flow’®. Stress gradients perpendicular to the shear direction have 
been shown to reduce the melt connectivity in the direction along 
the gradient’, offering a possible mechanism for our observed aniso- 
tropy. The depth extent over which the asthenosphere is being sheared 
can be inferred from the anisotropy of the melt layer along with its 
increased conductivity, which the magnetotelluric data constrain to a 
maximum thickness of 30 km. 

The existence of a horizontally extensive melt layer that is being 
sheared over a confined depth interval indicates the LAB is a thin, low- 
viscosity channel. Theoretical studies suggest that even small melt 
fractions (<1%) can lower viscosity by up to two orders of magnitude, 
effectively decoupling the lithosphere from the asthenosphere”’. The 
depth extent over which this decoupling occurs as well as its proximity 
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Figure 3 | High asthenosphere conductivity explained by a thin partially 
molten layer. Evidence for the stability of melt. a, Hydrous olivine resistivity is 
estimated as a function of H,O content for different temperatures'’. The dark 
grey region represents the observed resistivity (4—6 Q m) of the anomalous 
conductive layer, while the lighter grey region represents a 10 Q m conductive 
layer observed off-axis at the southern EPR’’. Olivine would be required to 
contain at least 800 p.p.m. H,O to account for the observed resistivity ifno melt 
is present. b, Bulk resistivity of partial melt shown as a function of melt fraction 
for different temperatures and bulk mantle water contents'® with an assumed 


to the trench axis have potentially new implications for plate dynamics, 
as previous studies have focused on the effects of melt at the ridge axis 
and mantle wedge. A pervasive feature that exists beneath oceanic 
plates at large distances from spreading centres requires the stability 
of a partially molten layer, as dictated by the solidus of wet peridotite 
and a warm mantle geotherm. A previous magnetotelluric study of 
140-150-Myr-old lithosphere in the Pacific Ocean basin did not find a 
conductive melt layer’’. This is consistent with a thicker cold plate 
containing too little water to sustain hydrous melts at the greater 
LAB depth. 

In our profile, the conductive layer ends near the trench, whereas the 
layer anisotropy persists in a trajectory that follows the subducting 
slab, albeit with a decreasing anisotropic factor. This signifies that as 
the plate subducts, some melt is likely to remain in place beneath the 
shallower LAB at the trench outer rise owing to its buoyancy. This 
offers an explanation in which the observed melt-rich LAB is caused by 
the concentration of low-fraction melts at the trench over time as the 
plate continues to be subducted; if this mechanism is responsible for 
locally enriching the asthenosphere with melt, we predict that melt 
would be most concentrated near the trench and decrease towards 
younger regions of the plate. 


METHODS SUMMARY 


We processed the magnetotelluric data using a standard robust multi-station 
impedance estimation approach at periods of 22-32,000s (ref. 29). Supplemen- 
tary Fig. 1 presents the data and two-dimensional inversion model fits. The data 
subset suitable for two-dimensional modelling was evaluated by inspecting impe- 
dance polarization diagrams (Supplementary Fig. 2). Sites S31-S34, located on 
the margin slope near the base of the trench, display strong three-dimensional 
effects and were omitted entirely. The rest of the data exhibit a profile-wide 
rotation in strike angle at periods greater than 1,024s that is incompatible with 
two-dimensional modelling and were also omitted. 

The two-dimensional triaxially anisotropic inversion model was obtained using 
a standard inversion approach implemented using a new parallel adaptive finite 
element algorithm’’. In addition to penalizing spatial variations in the model 
resistivity, the inversion included a penalty to minimize the amount of anisotropy 
produced in the inverted model. The data uncertainty was subject to a 10% error 
floor, with the inversion finding a smooth model with a root-mean-square (r.m.s.) 
misfit of 1.0 for the combined TE and TM mode data. Supplementary Fig. 3 shows 
the resistivity obtained for each coordinate axis. 

Three tests were performed to investigate the sensitivity of the model to the 
interpreted features (Supplementary Fig. 4). All attempts significantly increased 
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mineral/melt partition coefficient of 0.006. ¢, Solid lines depict the solidus of dry 
and wet peridotite for various mantle H,O contents'*. The dashed lines show 
the geotherms derived from a plate cooling model of 23-Myr-old oceanic 
lithosphere’. Melt is stable at depths greater than 45 km for peridotite with 
275 + 85 p.p.m. H,O anda 1,420 °C mantle potential temperature. A geotherm 
with a reduced mantle potential temperature of 1,315 °C requires 

505 + 155 p.p.m. H,O. The solidus of wet peridotite is calculated with the 
cryoscopic approximation assuming an oxide molar mass"®. 


the r.m.s. misfit, confirming the strong data sensitivity to the interpreted features 
of the converged model. 

The solidus of wet peridotite was derived using the cryoscopic approximation 
with a molar enthalpy of fusion formulation (as described in ref. 16). The mole 
fraction of water in the melt was calculated with an oxide molar unit (59 g mol — YY, 
The solidi are compared to geotherms calculated from a plate cooling model that 
has a 95-km-thick plate with a 1,420 °C or 1,315 °C mantle potential temperature 
for 23-Myr-old oceanic plate’’. 
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The genomic signature of dog domestication reveals 
adaptation to a starch-rich diet 


Erik Axelsson', Abhirami Ratnakumar’, Maja-Louise Arendt’, Khurram Maqbool!, Matthew T. Webster’, Michele Perloski’, 
Olof Liberg*, Jon M. Arnemo*®, Ake Hedhammar® & Kerstin Lindblad-Toh? 


The domestication of dogs was an important episode in the 
development of human civilization. The precise timing and loca- 
tion of this event is debated” and little is known about the genetic 
changes that accompanied the transformation of ancient wolves 
into domestic dogs. Here we conduct whole-genome resequencing 
of dogs and wolves to identify 3.8 million genetic variants used to 
identify 36 genomic regions that probably represent targets for 
selection during dog domestication. Nineteen of these regions con- 
tain genes important in brain function, eight of which belong to 
nervous system development pathways and potentially underlie 
behavioural changes central to dog domestication®. Ten genes with 
key roles in starch digestion and fat metabolism also show signals 
of selection. We identify candidate mutations in key genes and 
provide functional support for an increased starch digestion in 
dogs relative to wolves. Our results indicate that novel adaptations 
allowing the early ancestors of modern dogs to thrive on a diet rich 
in starch, relative to the carnivorous diet of wolves, constituted a 
crucial step in the early domestication of dogs. 

Domestic animals are crucial to modern human society, and it is likely 
that the first animal to be domesticated was the dog. Claims of early, 
fossilised dog remains include a 33,000-year-old doglike canid from 
the Altai Mountains in Siberia’, whereas fossils dating from 12,000- 
11,000 years Bp found buried together with humans in Israel’ could rep- 
resent the earliest verified dog remains. Patterns of genomic variation 
indicate that dog domestication started at least 10,000 years BP** in south- 
ern East Asia‘ or the Middle East’. Dog domestication may however have 
been more complex, involving multiple source populations and/or back- 
crossing with wolves. 

It is unclear why and how dogs were domesticated. Humans may 
have captured wolf pups for use in guarding or hunting, resulting in 
selection for traits of importance for these new roles. Alternatively, as 
humans changed from a nomadic to sedentary lifestyle during the 
dawn of the agricultural revolution, wolves may themselves have been 
attracted to dumps near early human settlements to scavenge®. Natural 
selection for traits allowing for efficient use of this new resource may 
have led to the evolution of a variety of scavenger wolves that con- 
stituted the ancestors of modern dogs. Regardless of how dog domesti- 
cation started, several characteristics separating modern dogs from 
wolves, including reduced aggressiveness and altered social cognition 
capabilities’, suggest that behavioural changes were early targets of this 
process’. Dogs also differ morphologically from wolves, showing 
reduced skull, teeth and brain sizes°. Artificial selection for tameness 
in silver foxes indicates that selection on genetic variation in develop- 
mental genes may underlie both behavioural and morphological 
changes, potentially representing an important mechanism through- 
out animal domestication”®. 

At present, only a handful of genes separating wild from domestic 
forms have been identified in any domestic animals, including coat 


colour variants in MCIR in pig’ and a mutation in TSHR likely to affect 
seasonal reproduction in chicken", but to our knowledge in dogs no 
genome-wide sequence-based searches have been performed until 
now. To identify genomic regions under selection during dog domesti- 
cation we performed pooled whole-genome resequencing of dogs and 
wolves followed by functional characterization of candidate genes. 

Uniquely placed sequence reads from pooled DNA representing 
12 wolves of worldwide distribution and 60 dogs from 14 diverse breeds 
(Supplementary Table 1) covered 91.6% and 94.6%, respectively, of 
the 2,385 megabases (Mb) of autosomal sequence in the CanFam 2.0 
genome assembly". The aligned coverage depth was 29.8 for all dog 
pools combined and 6.2 for the single wolf pool (Supplemen- 
tary Table 1 and Supplementary Fig. 1). We identified 3,786,655 putat- 
ive single nucleotide polymorphisms (SNPs) in the combined dog and 
wolf data, 1,770,909 (46.8%) of which were only segregating in the dog 
pools, whereas 140,818 (3.7%) were private to wolves (Supplementary 
Table 2). Similarly we detected 506,148 short indels and 26,619 copy- 
number variations (CNVs) (Supplementary Files 1 and 2). We were 
able to experimentally validate 113 out of 114 tested SNPs (Sup- 
plementary Table 3 and Supplementary Discussion, section 1). 

To detect signals of strong recent selection we searched the dog 
genome for regions with reduced pooled heterozygosity (Hp)'° and/ 
or increased genetic distance to wolf (Fst). As evident from the skewed 
distribution of heterozygosity scores in dog relative to wolf (Fig. laand 
Supplementary Fig. 2), a major challenge to this approach is to sepa- 
rate true signals of selection from those caused by random fixation of 
large genomic regions during the formation of dog breeds'’. We alle- 
viate this problem by combining sequence data from all dog pools 
before selection analyses and require that detected signals span at least 
200 kilobases (kb; Methods and Supplementary Discussion, sections 2 
and 3). Given the complex and partly unknown demographic history 
of dogs, it is furthermore difficult to assign strict thresholds that dis- 
tinguish selection and drift. We propose that the best way to validate 
regions detected here is to study genetic data from additional indivi- 
duals and provide evidence for functional change associated with 
putatively selected regions. Eventually, indications that similar path- 
ways changed during independent domestication events may provide 
conclusive evidence for selection. Here we Z-transform the autosomal 
Hp (Z(Hp)) and Fsr (Z(Fsr)) distributions (see Supplementary Dis- 
cussion, section 4 for an analysis of the X chromosome) and focus 
our description of putatively selected regions to those that fall at least 
five standard deviations away from the mean (Z(Hp)<—5 and 
Z(Fsr) > 5), as these represent the extreme ends of the distributions. 
By applying these thresholds we identified 14 regions in the dog 
genome with extremely low levels of heterozygosity (average length = 
400 kb, average Hp = 0.036 (range 0.015-0.056), average autosomal 
Hp = 0.331) (Fig. 1c and Supplementary Table 4) and 35 regions 
with strongly elevated Fg; values (average length = 340 kb, average 
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Figure 1 | Selection analyses identified 36 candidate domestication regions. 
a, Distribution of Z-transformed average pooled heterozygosity in dog 
(Z(Hp)poag) and wolf (Z(Hp)wotr) respectively, as well as average fixation 
index (Z(Fs)), for autosomal 200 kb windows (a, standard deviation; 1, 
average). b, The positive end of the Z(Fsr) distribution plotted along dog 


Fsy = 0.734 (range 0.654-0.903), average autosomal Fs; = 0.223) 
(Fig. 1b and Supplementary Table 5). All Fs7 regions are characterized 
by low levels of heterozygosity in either dog or wolf (although all do not 
pass the Z(Hp)<—5 threshold), indicating that the two statistics 
detect the same events (Methods and Supplementary Discussion, 
sections 2 and 3). In total, 36 unique autosomal candidate domestica- 
tion regions (CDRs) containing 122 genes were identified by the 
two approaches combined (Supplementary Table 6 and Fig. 1b, c). 
None of these regions overlaps those of a previous genotype-based 
study’ (Supplementary Discussion, section 3), stressing the im- 
portance of identifying domestication regions directly by sequencing 
or by comprehensively ascertaining SNPs in wild ancestors before 
genotyping. 

We searched for significantly overrepresented gene ontology terms 
among genes in autosomal CDRs and identified 25 categories, repre- 
senting several groups of interrelated terms (Table 1 and Supplemen- 
tary Table 7), none of which was indicated in a separate analysis of 
selection in wolf (Supplementary Discussion, section 8). The most 
conspicuous cluster (11 terms) relates to the term ‘nervous system 
development’. The eight genes belonging to this category (Supplemen- 
tary Tables 7 and 8) include MBP, VWC2, SMO, TLX3, CYFIP1 and 
SH3GL2, of which several affect developmental signalling and synaptic 
strength and plasticity'**°. We surveyed published literature and iden- 
tified 11 additional CDR genes with central nervous system function 
(Supplementary Table 9), adding to a total of 19 CDRs that contain 
brain genes. These findings support the hypothesis that selection for 
altered behaviour was important during dog domestication and that 
mutations affecting developmental genes may underlie these changes’. 

The gene ontology analysis also pinpoints two genes involved in the 
binding of sperm and egg: ZPBP encodes the zona pellucida binding 
protein that mediates binding of sperm to the zona pellucida glycopro- 
tein layer (ZP) of the egg, and ZP2 codes for one of the proteins that 
make up ZP itself. In addition, a CDR on chromosome 6 encompasses 
PDILT that also affects binding of sperm to ZP”, altogether indicating 
that sperm competition may have been an important evolutionary 
force during dog domestication”’. 
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autosomes 1-38 (chromosomes are separated by colour). A dashed horizontal 
line indicates the cut-off (Z > 5) used for extracting outliers. c, The negative end 
of the Z(Hp) distribution plotted along dog autosomes 1-38. A dashed 
horizontal line indicates the cut-off (Z << —5) used for extracting outliers. 


Overrepresented terms ‘starch metabolic process’, ‘digestion’ and 
‘fatty acid metabolism’ include genes involved in starch digestion 
(MGAM) and glucose uptake (SGLT1), as well as a candidate gene for 


Table 1 | Enriched gene ontology terms among CDR genes 


Gene ontology term Pepp Value Gene count 
Regulation of neuron differentiation 0.005 3 (26) 
Multicellular organismal process 0.005 21 (3,822) 


Neuron differentiation 0.010 5 (210) 
Regulation of molecular function 0.011 8 (671) 
Central nervous system development 0.013 5 (235) 
Regulation of developmental process 0.013 5 (236) 
Generation of neurons 0.013 5 (242) 


Nervous system development 0.013 8 (716) 


Neurogenesis 0.015 5 (262) 


7 (605) 
5 (307) 


0.020 
0.026 


Regulation of catalytic activity 
Regulation of hydrolase activity 


System development 0.034 11 (1,605) 
Regulation of GTPase activity 0.039 4 (211) 
Anatomical structure development 0.039 12 (2,005) 
Intramembranous ossification 0.039 1 (1) 
Quinolinate metabolic process 0.039 1 (1) 


0.039 
0.039 


1 (1) 
9 (1,242) 


Glucocorticoid catabolic process 
Cell development 
Enriched terms are colour-coded to reflect relatedness in the ontology or functional proximity. Blue, 
nervous system development; green, sperm-egg recognition; grey, regulation of molecular function; 


orange, digestion. For each term, gene count shows number of genes in CDRs relative to total number of 
annotated genes (in parentheses). 
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Hp (blue), and average fixation index, Fs (orange), plotted for 200-kb windows 
across a chromosome 6 region harbouring AMY2B. b, Heterozygosity, H (blue), 
and fixation index, Fs; (orange), for single SNPs in the selected region. Dog 
relative to wolf coverage, rC (green line), indicates increase in AMY2B copy 


insulin resistance (ACSM24A) that initiates the fatty acid metabolism”. 
A total of 6 CDRs harbour 10 genes with functions related to starch 
and fat metabolism (Supplementary Table 10). We propose that gene- 
tic variants within these genes may have been selected to aid adapta- 
tion from a mainly carnivorous diet to a more starch rich diet during 
dog domestication. 

The breakdown of starch in dogs proceeds in three stages: (1) starch 
is first cleaved to maltose and other oligosaccharides by alpha-amylase 
in the intestine; (2) the oligosaccharides are subsequently hydrolysed 
by maltase-glucoamylase”’, sucrase and isomaltase to form glucose; 
and (3) finally, glucose is transported across the plasma membrane 
by brush border protein SGLT1*". Here we present evidence for selec- 
tion on all three stages of starch digestion during dog domestication. 

Whereas humans have acquired amylase activity in the saliva” via 
an ancient duplication of the pancreatic amylase gene, dogs only 
express amylase in the pancreas”. In dogs the AMY2B gene, encoding 
the alpha-2B-amylase, resides in a 600-kb CDR on chromosome 6 with 
Z(Hp) and Z(Fsr) scores of — 4.60 and 7.16, respectively (Figs 1 and 2a). 
Interestingly, an 8-kb sequence spanning the AMY2B locus showed 
a several-fold increase in aligned read depth in dog relative to wolf 
(Fig. 2b), suggestive of a copy number change. Formal comparisons of 
regional and local pool coverage, and wolf and dog coverage (Methods), 
respectively, also suggest a substantial increase in copy numbers in all 
dog pools compared to wolf at this locus (Supplementary Discussion, 
section 5). 

We confirmed this CNV by quantifying AMY2B copy numbers in 
136 dogs and 35 wolves (Supplementary Table 11) using real-time 
quantitative PCR (qPCR). Whereas all wolves tested carried only 
2 copies (2N = 2), diploid copy numbers in dog ranged from 4 to 30 
(P<0.001, Wilcoxon) (Fig. 2c), corresponding to a remarkable 
7.A4-fold average increase in dog AMY2B copy numbers. To assess 
whether this change correspond to a difference in amylase activity, 
we first compared AMY2B gene expression in pancreas from dog 
(n = 9) and wolf (n = 12) and noted a 28-fold higher average expres- 
sion in dog (P<0.001, Wilcoxon, Fig. 2d). We then quantified 
amylase activity in frozen serum (Fig. 2e) and found a 4.7-fold 
higher activity in dog (9.6-18.4 ukatl (n= 12)) relative to wolf 
(1.4-4.3 pikatl~! (n = 13)) (P< 0.001, Wilcoxon). Similar results were 
obtained in comparisons of a limited number of fresh samples 
(Supplementary Tables 12 and 13). The change in AMY2B gene copy 
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number in dog. Genes in the region are shown below panel b. c, Histogram 
showing the distribution of diploid amylase copy number in wolf (n = 35) 
(blue) and dog (n = 136) (red). d, Amylase messenger RNA expression levels in 
pancreas of wolf (n = 12) and dog (n = 9). e, Amylase activity in serum from 
wolf (1 = 13) and dog (n = 12). 


number together with a correlated increase in both expression 
level (rho = 0.84, P< 0.0001, Spearman) (Supplementary Fig. 3) and 
enzyme activity (rho = 0.63, P<0.01, Spearman) (Supplementary 
Fig. 4) indicates that duplications of the alpha-amylase locus conferred 
a selective advantage to early dogs by causing an increase in amylase 
activity. 

Maltase-glucoamylase is responsible for the second step in the 
breakdown of starch, catalysing the hydrolysis of maltose to glucose”. 
No copy number changes were observed in the MGAM locus so we 
decided to study haplotype diversity across the region to facilitate the 
identification of causal variants. We genotyped 47 randomly selected 
SNPs in 71 dogs representing 38 diverse breeds and 19 wolves of 
worldwide distribution (referred to as ‘the reference panel’, Sup- 
plementary Table 14). Sixty-eight of the seventy-one dogs tested car- 
ried at least one copy of a 124-kb long haplotype spanning the entire 
MGAM anda small neighbouring locus encoding the bitter taste medi- 
ating taste receptor 2 member 38 (TASR38) (Fig. 3a—c). Whereas none 
of the wolves carried the selected haplotype, 55 dogs were homozygous 
for it, 13 were heterozygous and only three dogs lacked it (2 West 
Highland White Terriers and 1 Chinese Crested Dog). This high 
degree of haplotype differentiation between dog and wolf (average 
Fsy for genotyped SNPs = 0.75) indicates that this haplotype may 
harbour genetic variation of selective advantage to dogs (Supplemen- 
tary Discussion, sections 3 and 6). 

We identified several candidate mutations within MGAM that may 
have been targeted by selection in this region (Supplementary Table 15). 
First a conservative amino acid substitution located in the duplicated 
trefoil domain of MGAM (residue 1001) is nearly fixed for isoleucine in 
wolf and for valine in dogs. Eleven out of fourteen mammals have valine 
at this position, whereas the omnivorous rat, and the insectivorous 
hedgehog and short-tailed opossum, carry isoleucine like the wolf 
(Supplementary Table 16). Second, another conservative substitution, 
methionine to valine, located in the beta sheet of the maltase enzyme 
(residue 797), is segregating in wolf but fixed for methionine in dog. The 
insectivorous hedgehog and common shrew are the only mammals 
without methionine at this evolutionarily conserved position (Sup- 
plementary Table 17) and in silico modelling using the SDM-server 
indicates that a change from methionine to valine at this residue is 
destabilizing’. Third, a fixed two-base-pair deletion in dog disrupts 
the stop codon, thereby extending the carboxy-terminal end of dog 
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Figure 3 | Selection is associated with increased maltase activity. a, Pooled 
heterozygosity, Hp (blue), and average fixation index, Fs; (orange), plotted for 
200-kb windows across a chromosome 16 region harbouring MGAM. 

b, Heterozygosity, H (blue), and fixation index, Fsy (orange), for single SNPs in 
the selected region. c, Haplotypes inferred from genotyping of 47 SNPs across 
the MGAM locus in 71 dogs and 19 wolves (red and blue colour are major and 
minor dog allele, respectively). Genes in the genotyped region are shown below 
panel c. d, MGAM mRNA expression levels in pancreas of wolf (n = 8) and dog 
(n = 9). e, MGAM activity in serum from wolf (n = 8) and dog (n = 7). 


MGAM by two amino acids: asparagine and phenylalanine. In 32 mam- 
mals studied only herbivores (rabbit, pika, alpaca and cow) and omni- 
vores (mouse lemur and rat) share an extension like that seen in dog 
(Supplementary Table 18). A fourth candidate mutation in intron 37 
affects a predicted binding site for the glucose metabolism regulator 
NR4A2 protein” by shifting the wolf sequence away from the canonical 
NR4A2-binding motif. Three out of four mammals with the wolf allele 
at this site rely heavily on insects or fish for their nutritional require- 
ments (Supplementary Table 19). 

To decipher whether the candidate mutations act primarily on expres- 
sion or protein activity we examined MGAM expression in pancreas 
and the resulting enzymatic activity in serum. Dogs showed a ~12-fold 
higher expression (P < 0.001, Wilcoxon, npog = 9; Nwotr = 8) (Fig. 3d) 
and a ~twofold increase in maltose to glucose turnover compared to 
wolves (average glucose produced in dogs: 0.94 AAszonm (0.64-1.23, 
n=7) and wolves: 0.52 AAszonm (0.44-0.66, n =8), P=0.0012, 
Wilcoxon) (Fig. 3e). Although we cannot rule out that diet-induced 
plasticity contributed to this difference”, our results indicate that the 
mutation affecting a NR4A2-binding site or another unknown variant 
probably affect the expression of MGAM. Selection may thus clearly have 
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led to increased MGAM expression, but we cannot rule out that the 
strong selection affecting this locus may have favoured the accumulation 
of protein-coding changes on the same haplotype. Similar scenarios have 
been seen for white coat colour in dogs and pigs, where repeated selection 
for additional mutations has resulted in an allelic series of white spotting 
at the MITF and KIT loci, respectively”. 

Once starch has been digested to glucose it is absorbed through 
the luminal plasma membrane of the small intestine by the sodium/ 
glucose cotransporter 1 (SGLT1)*’. To benefit from an increased 
capacity to digest starch, dogs would therefore be expected to show a 
parallel increase in glucose uptake. A CDR on chromosome 26 (Sup- 
plementary Fig. 5a, b) encompasses SGLT1 and a gene (SGLT3) encod- 
ing the glucose-sensing sodium/glucose cotransporter 3 protein”. To 
characterize the haplotype diversity we genotyped 48 randomly chosen 
SNPs across this CDR in the reference panel and identified a 50.5-kb 
region, spanning the 3’ section of SGLT1 as well as the 3’end of SGLT3, 
that is highly divergent between dog and wolf (Supplementary Fig. 5c). 
In this region all dogs tested were carriers of a particular haplotype, 
for which 63 were homozygous and eight heterozygous. This con- 
trasts to 19 wolves where a single individual carried one copy of the 
haplotype. Based on the high haplotype differentiation (average Fgr 
for 18 SNPs in 50.5-kb haplotype = 0.81) it is likely that SGLT1 and 
its 3’ region represents an additional dog domestication locus. 

The 50.5-kb region includes a conservative isoleucine to valine 
substitution in SGLT1 (residue 244) that affects a loop facing the 
extracellular side of the luminal membrane (Supplementary Table 15). 
Heterologous expression analysis” shows that glycosylation at a nearby 
site (residue 248) affects glucose transport, indicating that it is possible 
that dogs acquired improved glucose uptake as a result of the observed 
substitution. In addition, we see only non-significant differences in SGLT1 
expression in pancreas of dog (n =9) and wolf (n= 4) (P= 0.39, 
Wilcoxon) (Supplementary Fig. 6), indicating that selection primarily 
targeted a structural rather than regulatory mutation in SGLT1. 

In conclusion, we have presented evidence that dog domestication 
was accompanied by selection at three genes with key roles in starch 
digestion: AMY2B, MGAM and SGLT1. Our results show that adapta- 
tions that allowed the early ancestors of modern dogs to thrive ona diet 
rich in starch, relative to the carnivorous diet of wolves, constituted a 
crucial step in early dog domestication. This may suggest that a change 
of ecological niche could have been the driving force behind the 
domestication process, and that scavenging in waste dumps near the 
increasingly common human settlements during the dawn of the agri- 
cultural revolution may have constituted this new niche’®. In light of 
previous results describing the timing and location of dog domestica- 
tion, our findings may suggest that the development of agriculture 
catalysed the domestication of dogs. 

The results presented here demonstrate a striking case of parallel 
evolution whereby the benefits of coping with an increasingly starch- 
rich diet during the agricultural revolution caused similar adaptive 
responses in dog and human”. This emphasizes how insights from 
dog domestication may benefit our understanding of human recent 
evolution and disease. Finally, by understanding the genetic basis of 
adaptive traits in dogs we have come closer to unlocking the potential 
in dog and wolf comparisons to decipher the genetics of behaviour. 


METHODS SUMMARY 

Sequencing. We pooled genomic DNA from 12 individuals before mate-pair 
library construction and sequencing on the AB SOLiD system, version 3, accord- 
ing to standard manufacturer protocols. Sequencing reads were aligned to the 
CanFam 2.0 reference sequence using the Bioscope 1.1 software. 

Selection analyses. We identified variable sites in data combined from all pools 
and required a minimum of three reads supporting an alternative allele to call a 
SNP. We used allele counts at variable sites to identify signals of selection in 200-kb 
windows using two approaches: for each window we calculated (1) the average 
pooled heterozygosity, Hp (ref. 10), and (2) the average fixation index, Fer, 
between dog and wolf. Putatively selected regions were located by extracting 
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windows from the extreme tails of the Z-transformed Hp and Fey distributions by 
applying a threshold of 5 standard deviations. 

Functional assays. We used multiplex TaqMan assays and SYBR Green real-time 
PCR to quantify CNVs and gene expression, respectively. Serum amylase activity 
was analysed using an Architect e400 instrument and serum maltase activity was 
quantified based on the amount of maltose to glucose turnover. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


DNA extraction. DNA was extracted from tissue using Qiagen tissue DNA 
extraction kits or from EDTA blood using either manual salt precipitation or 
the QIASymphony DNA Midi kit (Qiagen) on the QIASymphony robot (Qiagen). 
Sequencing. We pooled DNA from 12 individuals per pool before mate-pair 
library construction and sequencing on the AB SOLiD system, version 3, accord- 
ing to standard manufacturer protocols (Applied Biosystems). Sequencing reads 
were aligned to the CanFam 2.0 reference sequence using the Bioscope 1.1 soft- 
ware. We removed duplicated (http://picard.sourceforge.net) and poorly mapped 
reads (mapping quality <20 in Samtools)*' and retained only uniquely mapped 
reads for further analyses. 

SNP detection. We searched for variable sites in data combined from all pools 
(including wolf) to increase sensitivity to rare alleles. We required a minimum of 
three reads supporting an alternative allele to call a SNP, and applied a further 
filtering step implemented in samtools.pl varFilter (settings: -Q25 -q10 -d3 
-D120 -G25 -w10 -N2 -130) to ensure a high call accuracy that is largely unaffected 
by, for example, paralogous sequence variants. We called genotypes for all SNPs in 
all dog pools and the single wolf pool by counting sequencing reads supporting the 
reference and variant allele, given a minimum base quality of 20, to estimate allele 
frequencies in the dog and wolf populations. A random selection representing 
25% of the sequencing reads from pools 4 and 5 were included in this process to 
achieve unbiased allele frequency estimates. 

Selection analyses. Allele counts and allele frequencies at all identified variable 
sites were used to search the dog genome for regions that may have been affected 
by selection during the early phase of dog domestication using two complementary 
approaches. First we calculated the average pooled heterozygosity (Hp) in 200-kb 
windows sliding 100 kb at a time, for all five dog pools combined, and in the single 
wolf pool separately, following the methodology described in ref. 10. Briefly, this 
method sums all minor and major allele counts, respectively, at all variable sites 
within a window, and estimates the heterozygosity based on the combined allele 
counts for the entire window. The advantage of this method over calculating a 
simple arithmetic mean of all single-site heterozygosity estimates is that it accounts 
for variable sequence coverage across the window. To avoid spurious selection 
signals we discarded 49 out of 21,927 windows containing fewer than 10 inform- 
ative sites from both this and the subsequent Fs; analysis. We Z-transformed the 
resultant distribution of Hp scores and extracted putatively selected windows in 
the extreme tail of the distribution by applying a Z(Hp) < —5 cut-off. 

Second we calculated Fsr values between dog and wolf for individual SNPs 
using a method that adjusts for sample size differences’. We averaged Fey values 
across 200-kb windows, sliding 100 kb at a time and Z-transformed the resultant 
distribution. Putative selection targets were extracted from the extreme tail of the 
distribution by applying a Z(Fsr) > 5 cut-off, and attributed to selection in dog if 
the corresponding Z(Hp)pog < Z(Hp)wo sr: to selection in wolf if Z’Hp)wotr < 
Z(Hp)poc (three regions), and to selection in both taxa if Z(Hp)worr < —4 and 
Z(Hp) wor < —4. 

Gene ontology analysis. We used the Ensembl gene annotations to identify genes 
residing within regions extending 100 kb up- and downstream of CDRs to include 
potential effects of regulatory changes on loci at some distance, and to reduce the 
risk of excluding the outermost portions of the selected haplotypes by using sliding 
windows of fixed size. We tested for enrichment of gene ontology terms (GOa- 
human) assigned to the subset of these CDR genes for which human orthology 
could be established (79 out of 122) using the GOstat program”. 

Genotyping validation. We designed an iPLEX assay targeting 124 SNPs located 
in CDRs showing a high degree of homozygosity or population differentiation. A 
total of 71 dogs, representing 38 different breeds, and 19 wolves (Supplementary 
Table 14) were genotyped using standard protocols provided by the manufacturer 
(Sequenome). Haplotypes were phased using fastPHASE™. 

qPCR CNV detection. We quantified DNA copy number variation using Multi- 
plex TaqMan assays containing primers and probes (Supplementary Table 20) 
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matching both the target and reference sequence (housekeeping gene C7orf28b) 
according to the manufacturer’s protocol. All reactions were run in triplicate and 
data was analysed using the CopyCaller software (Applied Biosystems). Copy 
numbers for each target were normalized to the same wolf to account for inter- 
plate variability. 

qPCR expression analyses. Pancreatic tissue samples from dogs and wolves 
where collected post mortem, stored in RNAlater at 4 °C for 24h and subsequently 
freeze-stored at —80°C. We used TRIzol to isolate RNA from these samples, 
followed by complementary DNA synthesis using the Advantage RT for PCR 
kit according to the manufacturers’ protocols (Life Technologies and Clontech, 
respectively). We designed exonic primers (Supplementary Table 21) and quan- 
tified the amount of cDNA using SYBR Green real-time PCR (Applied bio- 
sciences) on a 7900HT Fast real time PCR system (Applied Biosystems) and 
analysed the data using the qbasePLUS (Biogazelle) software according to the 
AAC, method. All reactions where run in triplicate and normalized by compari- 
sons to housekeeping genes RPL32 and RPL13A. 

Amylase activity. Peripheral EDTA and serum blood samples where collected 
from dogs and both captive and free-ranging wolves. Serum amylase activity was 
analysed at the Clinical Pathology service (Swedish Agricultural University) using 
an Architect e400 instrument (Abott Laboratories), except for 8 serum samples 
(Supplementary Table 13) which were run on a VetScan instrument (Abaxis). 
Maltase activity. Maltase activity was assayed according to the principle outlined 
in ref. 35, whereby a known amount of maltose substrate is added to serum and 
the resultant glucose produced is measured as the change in absorbance after 
five minutes (AAs7onm). We used reagents from the ab83388 Maltose assay kit 
(Abcam) and serum sampled as described above. For each individual, glu- 
cose residuals were measured in duplicate and maltase assays were performed 
in triplicate. 

Indel calling. We used Bioscope 1.1 to call small insertions and deletions in each 
pool separately. We then combined the results of all pools and extracted a set of 
high confident indels by requiring that indels were supported by at least three 
sequencing reads. 

CNV detection. Four methods were used to detect structural variation in the dog 
genome. We searched for deviations in insert size using the large indel tool imple- 
mented in Bioscopel.1. We compared the coverage depth between the pooled 
samples using CNVseq” and the Fixed deletions method”? and finally identified 
regions in which the coverage depth deviated from the pool average using 
CNVnator’’. Methods relying on comparisons of sequence coverage between 
pools always used the wolf as reference pool. 

Ethics. All animals contributing tissue samples to this study died for other reasons 
than participating in this study. All dog samples were taken with the owners 
consent. The sampling conformed to the decision of the Swedish Animal 
Ethical Committee (no. C62/10) and the Swedish Animal Welfare Agency 
(no.31-1711/10). 
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To maintain lifelong production of blood cells, haematopoietic 
stem cells (HSCs) are tightly regulated by inherent programs and 
extrinsic regulatory signals received from their microenvironmen- 
tal niche. Long-term repopulating HSCs reside in several, perhaps 
overlapping, niches that produce regulatory molecules and signals 
necessary for homeostasis and for increased output after stress or 
injury’ °. Despite considerable advances in the specific cellular or 
molecular mechanisms governing HSC-niche interactions, little is 
known about the regulatory function in the intact mammalian 
haematopoietic niche. Recently, we and others described a positive 
regulatory role for prostaglandin E, (PGE,) on HSC function ex 
vivo®’. Here we show that inhibition of endogenous PGE, by non- 
steroidal anti-inflammatory drug (NSAID) treatment in mice 
results in modest HSC egress from the bone marrow. Surpri- 
singly, this was independent of the SDF-1-CXCR4 axis implicated 
in stem-cell migration. Stem and progenitor cells were found to 
have differing mechanisms of egress, with HSC transit to the peri- 
phery dependent on niche attenuation and reduction in the reten- 
tive molecule osteopontin. Haematopoietic grafts mobilized with 
NSAIDs had superior repopulating ability and long-term engraft- 
ment. Treatment of non-human primates and healthy human 
volunteers confirmed NSAID-mediated egress in other species. 
PGE, receptor knockout mice demonstrated that progenitor 
expansion and stem/progenitor egress resulted from reduced 
E-prostanoid 4 (EP4) receptor signalling. These results not only 
uncover unique regulatory roles for EP4 signalling in HSC reten- 
tion in the niche, but also define a rapidly translatable strategy to 
enhance transplantation therapeutically. 

Mice were treated with the prototypical NSAID indomethacin 
(Supplementary Fig. 1a) to reduce endogenous PGE, production, result- 
ing ina significant increase in haematopoietic progenitor cells (HPCs) in 
the peripheral blood that was not accompanied by an increase in the 
number of white blood cells (Supplementary Fig. 1b, c). This HPC 
increase but unaltered white blood cell count probably accounts for 
the lack of previous detection of this observation, despite decades of 
clinical NSAID use. No increase in HPC egress was seen in mice treated 
with the lipoxygenase inhibitor baicalein, suggesting a cyclooxygenase 
(COX) pathway-specific effect. Co-administration of indomethacin 
with the clinically used mobilizing agent granulocyte colony-stimulating 
factor (G-CSF) significantly enhanced (~twofold) HPC mobilization 
(Supplementary Fig. 1b). NSAIDs with varying degrees of COX-1 and 
COX-2 selectivity demonstrated significant mobilization with indo- 
methacin, aspirin, ibuprofen and meloxicam (Supplementary Fig. 2). 
Meloxicam inhibits both COX-1 and COX-2 in the bone marrow micro- 
environment (Supplementary Fig. 3), and when compared to other dual 


inhibitors it has a reduced incidence of gastrointestinal discomfort* and 
inhibition of platelet aggregation’. Therefore, meloxicam was used in 
most of the studies. We did not extensively test the differential roles of 
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Figure 1 | NSAIDs mobilize haematopoietic stem and progenitor cells. 


a, b, Meloxicam enhances mobilization of HPCs (a) and HSCs (b) into blood 
(n = 4-5 mice per group per experiment; three experiments). BFU-E, erythroid 
progenitors (burst-forming unit-erythrocyte); CFC, colony-forming cells; 
CFU-GEMM, multipotential progenitor cells (colony-forming unit- 
granulocyte, erythrocyte, macrophage, megakaryocyte); M, meloxicam. 

c, Chimaerism. PBMCs, peripheral blood mononuclear cells; WBM, whole 
bone marrow cells. d, e, Competitive repopulating units (CRU) (d), and LT- 
HSC frequency (Poisson distribution) (e), 36 weeks after limiting dilution 
competitive transplants of PBMCs from mice treated with G-CSF and 
combination regimens (n = 8 mice per group, assayed individually). Mice were 
treated with G-CSF or a staggered regimen of G-CSF plus meloxicam, and 
PBMCs were transplanted into lethally irradiated mice. f, g, Neutrophil (f) and 
platelet (g) recovery were monitored for 90 days. *P < 0.05, **P < 0.01, 

***P < 0.001; unpaired two-tailed t-test. All data are mean + s.e.m. 
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Figure 2 | Non-human primates and healthy human volunteers mobilize 
HSCs and HPCs in response to NSAID treatment. a-—e, Four baboons were 
treated with G-CSF with/without meloxicam in a crossover design (shown in 
a). b, c, CD34" cells (b) and CFU-GM (c) in peripheral blood were determined. 
d, e, CD34" cells (d) and CFU-GM (e) in peripheral blood were determined 
before (pre) and after (post) five days of meloxicam treatment. f-i, Seven 
healthy human volunteers were treated with 15 mg meloxicam a day orally for 
five days, and assessed for CD34" cells (f), CFU-GM (g), BFU-E (h) and CFU- 
GEMM (i), before and after treatment. Statistics represent paired, two-tailed 
t-test; NS, not significant. 


COX-1 and COX-2 and, therefore, there may be similar activity of 
NSAIDs with different COX-1 or COX-2 inhibitory profiles when com- 
pared to meloxicam. 

Meloxicam, similar to indomethacin, increased the egress of HPCs 
(Fig. la and Supplementary Fig. 4) and the phenotypic HSC-enriched 
populations Sca-1* c-kit Lin” (SKL) or the highly purified CD1507 
CD48” (SLAM) SKL populations (Fig. 1b and Supplementary Fig. 4). 
Enhanced cell egress was maintained in Alox5 (also known as 5LX and 
5-ALOX) knockout mice (Supplementary Fig. 5), further demonstra- 
ting that the effects are not due to general eicosanoid inhibition. 
Enhancement in cell egress was also not specific to G-CSF, as meloxi- 
cam enhanced the cell mobilization caused by the clinically used 
CXCR4 antagonist AMD3100 (Supplementary Fig. 6). 

Despite significant increases in phenotypic HSCs and functional 
HPCs in the peripheral blood, two early transplant attempts did not 
show enhanced HSC engraftment (Supplementary Fig. 7a, b). Because 
we previously showed that PGE; signalling was a positive regulator of 
HSC CXCR4 expression and homing to the niche’, we proposed that 
although the HSC and HPC yield was increased in NSAID grafts, 
CXCR4 expression might be reduced, accounting for the apparent lack 
of enhanced engraftment. To test this hypothesis we staggered the 
administration of NSAID and G-CSF to allow for haematopoietic 
mobilization and restoration of normal endogenous PGE, signalling 
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Figure 3 | PGE, EP4 receptor antagonism/knockout expands bone marrow 
HPCs and enhances mobilization. a, HPC mobilization with G-CSF, G-CSF 
plus meloxicam, G-CSF plus E-prostanoid (EP) receptor antagonists, or 
G-CSF, meloxicam and E-prostanoid receptor agonists (n = 5 mice per group, 
assayed individually). b-d, The EP4 antagonist L-161,982 enhanced HSC 
mobilization (n = 4 mice per group, assayed individually) (b), and long-term 
reconstitution 16 weeks after transplantation (c), with multi-lineage 
reconstitution (n = 5 mice per group, assayed individually) (d). WBC, white 
blood cells. e, f£, EP4 knockout mobilization. Meloxicam enhances mobilization 
of HPCs (e) and SLAM SKL cells (f) in wild-type littermates (Ep4" if but not in 
EP4 conditional knockouts (EP4"*" KO) (n = 3-4 mice per group, assayed 
individually). *P < 0.05, **P < 0.01, ***P < 0.001; unpaired two-tailed t-test. 
+P <0.05 compared to G-CSF plus meloxicam. All data are mean + s.e.m. 


before transplantation (Supplementary Fig. 7c). CXCR4 levels were 
significantly lower after NSAID treatment, and staggered administra- 
tion allowed for restored receptor levels, while maintaining enhanced 
HSC egress (Supplementary Fig. 7d, e). We competitively transplanted 
mobilized grafts from G-CSF, or non-staggered and staggered G-CSF 
plus meloxicam-treated mice. Staggered administration resulted in 
significant enhancement of long-term repopulating HSC (LT-HSC) 
engraftment, with a 48-h stagger resulting in a 2.6-fold increase in LT- 
HSCs (Fig. 1c-e and Supplementary Fig. 8). When grafts were trans- 
planted non-competitively, staggered co-administration of meloxicam 
resulted in four-day faster recovery of neutrophils (Fig. 1f) and plate- 
lets (Fig. 1g) compared with G-CSF alone. Secondary transplantation 
confirmed sustained LT-HSC activity with multi-lineage reconstitu- 
tion 36 weeks after transplant (Supplementary Fig. 9). 

To confirm NSAID-mediated HSC and HPC egress in other species, 
four baboons were treated with a standard regimen of G-CSF, or with 
the combination of G-CSF plus meloxicam in a crossover design 
(Fig. 2a). Although the responses of individual baboons to G-CSF 
varied, in all cases meloxicam treatment increased CD34* cells 
(Fig. 2b) and granulocytic-monocytic progenitors (colony-forming 
unit-granulocyte, macrophage; CFU-GM) (Fig. 2c) in peripheral 
blood. Meloxicam treatment on its own also resulted in significant 
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HSC and HPC egress (Fig. 2d, e). In healthy human volunteers, melox- 
icam treatment resulted in significant increases in CD34* cells 
(Fig. 2f), and functionally defined HPCs (Fig. 2g-i), matching the 
HSC and HPC egress seen with meloxicam treatment in baboons 
and mice. Thus, short-term endogenous PGE, inhibition, closely 
resembling current clinical NSAID treatment, results in a previously 
unappreciated increase in HSC and HPC mobilization. 

Meloxicam treatment increased functionally defined myeloid pro- 
genitors and phenotypically defined granulocyte-macrophage pro- 
genitors in the bone marrow, but no differences in phenotypically or 
functionally defined HSC were observed (Supplementary Fig. 10). 
Because PGE, signals through four receptors (EP1-4), each with 
unique signalling pathways’°, we proposed that the myeloid expansion 
and egress was due to lack of signalling via one or more of the 
E-prostanoid receptors. Only agonists capable of activating the EP4 
receptor inhibited myeloid HPCs (Supplementary Fig. 11a). To con- 
firm further the specific role of the EP4 receptor, similar assays were 
performed using knockout mice for each of the E-prostanoid recep- 
tors. Comparison of all knockout strains showed that only HPCs from 
conditional EP4~'~ (also known as Ptger4'~) mice had reduced 
responses to inhibition by PGE, (Supplementary Fig. 11b), and a 
2.3-fold increase in marrow macrophage progenitors (colony-forming 
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unit-macrophage; CFU-M) compared to wild type (Supplementary 
Fig. 11c). Co-administration of EP4 antagonists with G-CSF signifi- 
cantly enhanced mobilization, similar to meloxicam, whereas EP1, 
EP2 and EP3 antagonists failed to increase mobilization (Fig. 3a). 
Furthermore, when a selective EP4 agonist was co-administered with 
G-CSF plus meloxicam, the meloxicam enhancement of mobilization 
was abrogated, and to the same degree as co-administration of the 
long-acting PGE, analogue 16,16-dimethyl-PGE, (dmPGE;). Ago- 
nists that did not target the EP4 receptor failed to alter meloxicam 
enhancement. EP4 antagonism with G-CSF enhanced mobilization of 
LT-HSCs (Fig. 3b-d), indicating that the NSAID-mediated effects on 
HPC/HSC egress are due to reduced EP4 receptor signalling. Con- 
sistent with pharmacological data, conditional EP4 deletion increased 
HPC and HSC egress (Supplementary Fig. 11d-f), and enhanced 
mobilization caused by meloxicam was abrogated (Fig. 3e, f). These 
data implicate PGE, and EP4 receptor signalling in mediating the cell 
egress effects of NSAIDs; however, because we did not conduct a 
comprehensive lipidomic profile we cannot exclude contributions of 
other eicosanoids. 

In vitro and in vivo results indicate that a lack of EP4 signalling 
drives HPC expansion, possibly explaining one mechanism responsi- 
ble for the enhanced HPC egress: more marrow HPCs allows more to 
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Figure 4 | NSAIDs attenuate haematopoietic supportive molecules and 
differentially mobilize HSCs and HPCs in Opn knockout and EP4 
conditional knockout mice. a—d, Assessment of Col2.3-GFP cells after vehicle 
(a) or meloxicam (b) demonstrates reduced percentages (c) and numbers (d) of 
osteolineage cells (expressed as the percentage/number of osteoblasts (ob) per bone 
surface) (n = 4 mice per group, assayed individually). e, Immunohistochemical 
staining of haematopoietic supportive molecules after treatment with meloxicam 
(original magnification, x 400). Arrows denote osteolineage cell layer lining the 


endosteum. f-h, Meloxicam enhances mobilization of HPCs in Opn’ ~ mice 


(f), with no enhancement in long-term reconstitution 16 weeks after 
transplantation (g, h). i, Representation of chimaera generation allowing 
conditional knockout of donor haematopoietic cells, or recipient stromal cells. EP4 
was deleted with tamoxifen 8 weeks after transplantation and mice were treated 
with G-CSF or G-CSF plus meloxicam. j, k, Enhanced mobilization of HPCs by 
meloxicam when EP4 is expressed on haematopoietic cells (j) and enhanced 
mobilization of HSCs when EP4 is expressed by stromal cells (kk) (n = 4 mice per 
group, assayed individually). WT, wild type. *P < 0.05, **P < 0.01, ***P < 0.001; 
unpaired two-tailed t-test. Data are mean + s.e.m. 
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be mobilized to the periphery. However, no alterations in bone mar- 
row HSC content were observed (Supplementary Fig. 10), suggesting 
that HSC mobilization results from a different mechanism, perhaps 
acting on the HSC niche. Gross histological analysis of NSAID-treated 
mice over 0-4 days showed a progressive increase in the laminarity of 
osteolineage cells lining the endosteum (Supplementary Figs 12 and 
13), similar to that seen after G-CSF treatment''. Comparable results 
were observed in collagen 2.3-GFP reporter mice (which express the 
enhanced green fluorescent protein gene 2.3 kilobases upstream of the 
rat Collal promoter), showing marked attenuation of osteolineage 
cells (Fig. 4a-d), and in mice after conditional EP4 deletion (Sup- 
plementary Fig. 14). Dynamic bone formation assays using staggered 
calcein double labelling and modified Goldner’s trichrome staining 
support significant attenuation of osteolineage cellular function 
(Supplementary Fig. 15). 

At present there is considerable debate about the direct or indirect 
roles of osteoclasts in haematopoietic niche regulation and HSC or 
HPC retention (reviewed in refs 12 and 13). To assess the role of 
osteoclasts, mice were treated with meloxicam and/or G-CSF with or 
without zoledronic acid, a potent inhibitor of osteoclast activity”. 
Similar to a recent report'*, zoledronic acid resulted in an increase in 
HSC or HPC mobilization by meloxicam and G-CSF (Supplementary 
Fig. 16), suggesting that increased osteoclast activity is not a mitigating 
mechanism for NSAID-mediated HSC and HPC egress. Niche attenu- 
ation and HSC or HPC mobilization by G-CSF have recently been 
reported to be mediated by marrow-resident monocyte/macro- 
phage populations’*"’. In contrast to G-CSF’*, immunohistochemi- 
cal analysis demonstrated that meloxicam does not reduce F4/80* 
macrophages (Supplementary Fig. 17a), nor is there a reduction in 
phenotypically defined macrophages assessed by flow cytometry 
(Supplementary Fig. 17b, c). We observed no changes in sinusoidal 
endothelial cell number or apoptotic state (Supplementary Fig. 18), 
nor sinusoid vessels or endothelial cell number by immunohisto- 
chemistry (Supplementary Fig. 19). Similarly, there was no alteration 
in the nestin* cell number (Supplementary Fig. 20). No differences in 
marrow MMP-9 or soluble c-kit, agents reported to regulate HSC 
motility in the bone marrow niche’’, were observed in NSAID-treated 
mice (data not shown), suggesting other unique HSC-retentive mole- 
cule(s) are regulated by EP4. 

We fractionated osteolineage cells into three sub-populations 
(Supplementary Fig. 21a). Quantitative PCR with reverse transcription 
(qRT-PCR) analysis revealed that all three populations expressed all 
four E-prostanoid receptors, with EP4 being expressed most predomi- 
nately (Supplementary Fig. 21b). Meloxicam treatment resulted in 
reductions in messenger RNA expression of several haematopoietic 
supportive molecules, including Jag1, Runx2, Vcam1, SCF (also known 
as Kitl), SDF-1 (Cxcl12), and osteopontin (Opn, also known as Spp1) 
(Supplementary Fig. 21c). Similarly, immunohistochemistry staining 
demonstrated reductions in SDF-1, OPN and N-cadherin expression 
(Fig. 4e). Analysis in EP4 conditional knockout mice showed a signifi- 
cant reduction in mesenchymal progenitor cells compared to Cre(—) 
littermates and wild-type controls (Supplementary Fig. 21d), further 
demonstrating a role for EP4 signalling in haematopoietic niche 
maintenance. 

Because the interaction of SDF-1 with its cognate receptor CXCR4 is 
a well-known mediator of niche retention, we sought to determine 
whether reduced expression of SDF-1 mediated the HSC and HPC 
egress caused by NSAID treatment. Surprisingly, despite the robust 
egress of cells in Cxcr4 conditional knockout mice, both HPC and HSC 
trafficking to the periphery were significantly enhanced by meloxicam 
(Supplementary Fig. 22). Osteopontin has been reported as a regulator 
of both HSC quiescence” and niche retention”. In contrast to CKCR4, 
when Opn knockout mice were treated with meloxicam or G-CSF for 
six days, meloxicam enhanced the mobilization of HPCs (Fig. 4f), but, 
unexpectedly, not of HSCs (Fig. 4g, h; additional data in Sup- 
plementary Fig. 23), whereas both HPCs and HSCs were mobilized 
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by G-CSF in wild-type mice. This surprising result indicates that 
NSAID-mediated OPN reduction is specifically responsible for the 
observed HSC niche egress, whereas the increased number of peri- 
pheral HPCs results from an independent mechanism(s). To deter- 
mine the differential roles of haematopoietic intrinsic versus stromal 
niche EP4 signalling in mediating HPC/HSC egress, we created chi- 
maeric mice in which we could conditionally delete EP4 from donor 
haematopoietic cells or recipient stromal cells (Fig. 4i). EP4 expression 
on haematopoietic cells was required for NSAID-mediated egress of 
HPCs (Fig. 4j), whereas EP4 on stromal cells was specifically necessary 
for HSC egress (Fig. 4k). These studies demonstrate that PGE, signal- 
ling differentially regulates HPC and HSC retention in the bone mar- 
row through both cell intrinsic and cell extrinsic mechanisms, and 
future studies should define the relative roles of individual stromal 
niche cell contributions to EP4-mediated niche retention. To our 
knowledge, this is the first report of an agent capable of mobilizing both 
HSCs and HPCs and doing so through cell-stage-specific mechanisms. 


METHODS SUMMARY 


C57Bl/6 and Opn’ ‘~ mice were purchased from Jackson Laboratories. B6.SJL- 
PtrcAPep3B/Boy] mice were bred in-house. Cxcr4"""* mice were generated as 
described”* and were a gift from Y. R. Zou. EP1 ’~ (also known as Ptgerl /~), 
EP2/~ (Ptger2 / ~), EP3/~ (Piger3 ‘~), and Ep4!*"* mice were generated as 
described****. Conditional mice were bred to Ubc-Cre/ERT2 mice from Jackson. 
Female olive baboons (Papio Anubis) were housed individually in conventional 
caging of the Biological Resources Laboratory, University of Illinois at Chicago. 
Primate research was approved by the University of Illinois Animal Care and Use 
Committee (IACUC). The IACUC of the Indiana University School of Medicine 
(IUSM) approved all protocols. The Institutional Review Board of IUSM approved 
human subject research and informed consent was acquired from all volunteers. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Animals and subjects. C57Bl/6 (CD45.2) mice were purchased from Jackson 
Laboratories (Bar Harbour). B6.SJL-PtrcAPep3B/Boy] (BOYJ) (CD45.1) mice 
were bred in-house. Cxcr4/"“""* mice were generated as described”* and were a 
gift from Y.-R. Zou. EPI ~’~ (also known as Ptger1 ~/), EP2/— (Ptger2 ‘~), 
EP3~’~ (Ptger3-/~), and Ep4!*"°" mice were generated as described’*”*. 
Opn ‘~ mice were purchased from Jackson Laboratories. Nestin-GFP”’, Col2.3- 
GFP** and Alox5 (ref. 29) mice were generated as described. Ep4!*Sx mice were 
bred to Ubc-Cre/ERT2 mice from Jackson Laboratories to generate conditional 
EP4 knockout mice. All mice were maintained on a C57B1/6 background. Female 
olive baboons (Papio Anubis), within the weight range of 16-19 kg, were housed 
individually in conventional caging and holding rooms of the Biological Resources 
Laboratory, a centralized animal facility for the University of Illinois at Chicago 
Medical Center. The conducted primate research was approved by the University 
of Illinois at Chicago Animal Care and Use Committee. The Animal Care and Use 
Committee of Indiana University School of Medicine (IUSM) approved all pro- 
tocols, and the Institutional Review Board approved human subject research. 
Informed consent was obtained from all volunteers. 

Peripheral blood and bone marrow acquisition and processing. Peripheral 
blood from mice was obtained by cardiac puncture after CO, asphyxiation using 
an EDTA-rinsed syringe. Blood was transferred to tubes containing EDTA for 
complete blood cell (CBC) analysis. CBC analysis was performed on a Hemavet 
950FS (Drew Scientific). PBMCs were prepared by centrifugation over lympholyte 
mammal (Cedarlane Laboratories) at 800 g for 30-40 min at room temperature, 
followed by triplicate washes. Bone marrow cells were collected by flushing femurs 
with ice-cold PBS, and single-cell suspensions were prepared by passage through a 
26-gauge needle. For baboons, peripheral blood was obtained from the femoral 
vein of baboons anaesthetized with an intramuscular injection of 10mgkg * 
ketamine hydrochloride (Bionichepharma). Blood was collected into 10-ml sterile 
EDTA vacutainers (Becton Dickinson) and transported on ice to IUSM for ana- 
lysis. Complete blood counts with differentials were performed on a Hemavet 
950FS. Peripheral blood was then diluted 1:3 with PBS and mononuclear cells 
were isolated using Ficoll-Paque Plus (Amersham Biosciences), per manufac- 
turer’s protocol. 

Colony assays. Bone marrow cells or PBMCs were resuspended in McCoy’s 5A 
modified media supplemented with 100U ml ' penicillin, 100 1g ml~' strep- 
tomycin, 0.6 modified essential medium (MEM) vitamin solution, 1mM 
sodium pyruvate, 0.8 MEM essential amino acids, 0.6X MEM non-essential 
amino acids, 0.05% sodium bicarbonate (all from Gibco), serine, asparagine, glu- 
tamine mixture and 15% HI-FBS (Hyclone Sterile Systems) as described*®*". Cells 
were mixed with 0.3% agar (Difco Laboratories) in McCoy’s 5A medium with 
10ngml_' rhGM-CSF and 50ngml * rmSCF (R&D Systems). PBMCs were 
cultured at 2 X 10° cells per ml and bone marrow cells at 5 X 10* cells per millilitre. 
All cultures were established in triplicate from individual animals, incubated at 
37°C, 5% CO», 5% Oz in air for seven days and colonies were quantified by 
microscopy. In some experiments, total CFC including CFU-GM, BFU-E and 
CFU-GEMM were enumerated in 1% methylcellulose/IMDM media containing 
30% FBS, 1 Uml | recombinant human erythropoietin (EPO), 10 ng ml! rhGM- 
CSF or rmGM-CSF and 50 ng ml"! rhSCF or rmSCF as described’?*?. In some 
experiments, phenotypically defined common myeloid progenitor (CMP) and 
granulocyte-macrophage progenitor (GMP) were plated at 500 cells per plate 
and colony growth determined in agar CFC assays with rmGM-CSF plus 
rmSCF or rmM-CSF. For analysis of CFC in baboons, similar assays were per- 
formed using recombinant human growth factors. 

Flow cytometry. All antibodies were purchased from BD Biosciences unless 
otherwise noted. For detection of SKL cells, we used streptavidin conjugated with 
phycoerythrin (PE)-Cy7 (to stain for biotinylated MACS lineage antibodies 
(Miltenyi)), c-kit-allophycocyanin (APC), Sca-1-PE or APC-Cy7, CD45.1-PE or 
CD45.2-FITC. For SLAM SKL, we used Sca-1-PE-Cy7, c-kit-FITC, CD150-APC 
(eBiosciences), CD48-biotin (eBiosciences) and streptavidin-PE. CXCR4 expres- 
sion was analysed using biotinylated lineage antibodies, streptavidin-PE-Cy7, 
c-kit-APC, Sca-1-APC-Cy7 and CXCR4-PE. For baboon CD34 analysis, CD34- 
PE (clone 563) was used. For macrophages, antibodies against CD115 (clone 
AFS98), Gr-1 (clone RB6-8C5) and F4/80 (clone CI:A3-1) were used. Osteo- 
lineage populations were identified and sorted as previously described’’. For enu- 
meration of bone marrow endothelial cells, femurs and tibias were crushed in a 
sterile mortar, and digested in collagenase (0.3%) at 37 °C for 1h. Recovered cells 
were co-stained with fluorochrome-conjugated antibodies against CD45, Ter119, 
Sca-1, VEGFR3 and CD31, and the total number of sinusoidal endothelial cells 
(SECs;CD45~ Terl19~ Sca-1~ VEGFR3* CD31*) per femur was enumerated by 
flow cytometry analysis. To examine endothelial cell apoptosis, gated CD45 
Terl19” Sca-1. VEGFR3* CD31~ cells were stained with annexin V (BD 
Biosciences) and LIVE/DEAD staining dye (Invitrogen). For enumeration of 


myeloid progenitors (CMP, GMP and megakaryocyte-erythroid progenitors 
(MEP)), femurs and tibias were flushed with 5 ml IMDM containing 2% FBS. 
Lineage-positive cells were depleted using the lineage-cell depletion kit 
(Miltenyi Biotec), and lineage-negative cells were stained with fluorochrome- 
conjugated antibodies against Sca-1, c-Kit, IL-7Ra, CD34 and FCRyII/Ill and 
analysed by flow cytometry. The Lin” IL-7Ra Sca-1” c-Kit” fraction was sub- 
divided into three subpopulations, CMP (FCRyII/Ie"CD34*), MEP (FCRYII/ 
ue’ CD347), and GMP (FCRYII/II"CD34 ), and collected by sorting. All flow 
cytometry analyses were performed on an LSRII flow cytometer (BD). Cell sorting 
was performed on a BD Aria or Reflection II or Reflection III sorters. 
Peripheral blood mobilization. Several different mobilization strategies were 
used, with specific details of dosing and schematics of dosing regimens shown on 
the data figures or included in the figure legends. In general, mice were given 
subcutaneous treatments of vehicle, NSAIDs (at varying doses), G-CSF (50 pig kg 
twice a day for four days), or G-CSF plus NSAIDs. For studies exploring mobilizing 
agents other than G-CSE, mice were treated with AMD3100 (5 mg kg * at day five; 
single injection), and peripheral blood was collected 1 h after AMD3100 treatment. 
For comparisons of several different NSAIDs, all NSAIDs were dosed by oral gavage 
using an enhanced oral gavage technique*’. Each gavage treatment was given in a 
0.2-ml bolus (10 ml kg ") of 0.5% methyl cellulose (Sigma-Aldrich) with an NSAID 
suspended in solution. For E-prostanoid receptor analysis, mice were mobilized 
with G-CSF in combination with meloxicam, AH6809 (EP1-3 antagonist, 10 pg per 
mouse, intraperitoneally, four days), AH23848 (EP4 antagonist, 10 1g per mouse, 
intraperitoneally, four days), L-161,982 (EP4 antagonist, 10 ug per mouse, intra- 
peritoneally, four days) or G-CSF plus meloxicam and an EP2, EP1/3 or EP4 agonist 
(10 ug per mouse, intraperitoneally, four days) or dmPGE, (10 pg per mouse, 
intraperitoneally, four days). For baboon studies, a baseline bleed was performed 
for CBC, CD34 and CFC analysis. Two days later, two baboons were treated with 
10ugkg ' G-CSF, and two baboons were treated subcutaneously with 10 ekg * 
G-CSF and 0.2 mg kg’ meloxicam on day 1, followed by 0.1 mg kg ' meloxicam 
for subsequent days, for a total of five days. Blood was collected after the treatment 
regimen for CBC, CD34 and CFC analysis. After a two-week resting period, the 
above procedure was repeated, switching treatment groups for individual baboons. 
Furthermore, after another two-week resting period, blood was collected before 
and after a five-day treatment regimen with meloxicam, and CBC, CD34 and CFC 
were analysed. For healthy volunteer studies, subjects naive to any medications 
within 30-days received a baseline bleed, followed by a second bleed after a five- 
day regimen of 15mg meloxicam per day, orally. CD34 cells were assessed by 
the International Society of Hematotherapy and Graft Engineering (ISHAGE) 
procedure* performed by the Stem Cell Laboratory of the USM Bone Marrow 
Transplant Program. CFC were assessed as described above. 

Limiting dilution competitive transplantation. CD45.1 mice were mobilized 
with a standard four-day regimen of G-CSF, or G-CSF plus a four-day regimen of 
meloxicam (6 mg kg '). In some studies designed to evaluate timing and duration 
of NSAID dosing in combination with G-CSF, initiation of the NSAID regimen 
preceded G-CSF treatement and was staggered such that NSAID administration 
ended simultaneous with the G-CSF regimen (no stagger), one day before G-CSF 
(1-day stagger) or two days before G-CSF (2-day stagger) (regimens as depicted in 
the corresponding data figure). On day 5, PBMCs were acquired and transplanted 
at 1:1, 2:1, 3:1 or 4:1 ratios with 5 X 10° C57Bl/6) WBM competitors into lethally 
irradiated C57Bl/6] recipient mice. Peripheral blood chimaerism was monitored 
monthly, and CRU and LT-HSC frequency calculated. Transplants to evaluate LT- 
HSC mobilized in Opn ‘~ mice or with EP4 antagonist were performed competi- 
tively at a 4:1 ratio; 800,000 PBMCs from CD45.2 mice versus 200,000 WBM from 
CD45.1 mice and peripheral blood chimaerism and multilineage reconstitution 
were assessed 16 weeks after transplantation. 

Recovery assay. Mice were mobilized with G-CSF or G-CSF plus meloxicam with 
staggered dosing as described above, and 2 X 10° mobilized PBMCs were trans- 
planted non-competitively into cohorts of ten lethally irradiated recipients per 
group. A cohort of non-irradiated mice was bled on the same schedule as the 
experimental-treated groups of mice. Every other day, five mice from each group 
were bled (~50 pil from a tail snip), and neutrophils and platelets in blood were 
enumerated using a Hemavet 950FS. Alternative groups of five mice were bled on 
each successive bleeding time point so that mice were only bled once every four 
days. Recovery of neutrophils and platelets to 50% and 100% were determined by 
comparison to the average neutrophil and platelet counts in the control group 
throughout the experimental period. After 90 days, mice were euthanized, bone 
marrow was collected and transplanted at a 2.5:1 ratio with 2 X 10° congenic 
competitors into lethally irradiated recipients to determine long-term repopulat- 
ing ability of the primary mobilized graft. 

EP4 chimaera generation and mobilization assay. Chimeras were generated using 
Ep4resox/flox mice and age- and sex-matched EP4/°"""* littermate controls. EP4/°"* 
mice were lethally irradiated and transplanted with 2x 10° WBM cells from either 
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Ep4te*fox mice, allowing for generation of a wild-type:wild-type chimaera, or from 
Epgretoxsex mice, allowing for generation of a knockout:wild-type chimaera. 
Similarly, EPA!” mice were lethally irradiated and transplanted with 2 x 10° 
WPM cells from EP4/"°*"°* mice, allowing for generation of a wild-type:knockout 
chimaera. At 8 weeks after transplantation, all mice were treated with 2 mg tamoxifen 
for three consecutive days, rested for three days and injected for three more days. Mice 
were then treated with G-CSF or G-CSF plus meloxicam starting ten days after the last 
treatment, and peripheral blood CFC and SLAM SKL were assessed as described. EP4 
gene deletion was confirmed by qRT-PCR. 

qRT-PCR. For E-prostanoid receptor expression on sorted osteolineage cells, total 
RNA was extracted with the purelink RNA micro kit (Invitrogen). On-column 
DNase treatment was performed according to the manufacturer’s instructions to 
eliminate contaminating genomic DNA. Conventional reverse transcription was 
followed with SuperScript III first-strand synthesis system (Invitrogen). RT-PCR 
was performed using the SYBR advantage qPCR Premix kit (Clontech) on MxPro- 
3000 (Agilent). Primers were synthesized at IDT (Supplementary Table 1). A pri- 
mer concentration of 250nM was found to be optimal in all cases. The PCR 
protocol consisted of one cycle at 95°C (5 min) followed by 45 cycles of 95°C 
(15s), 55°C (30s) and 72°C (30s). The dissociation curves were determined on 
each analysis to confirm that only one product was obtained. Expression of glycer- 
aldehyde-3-phosphate dehydrogenase (GAPDH) and hypoxanthine guanine phos- 
phoribosyl transferase (HPRT) were used as reference genes. The average threshold 
cycle number (C;) for each tested mRNA was used to quantify the relative expres- 
sion of each gene. For analysis of haematopoietic supportive molecules on sorted 
osteolineage cells from vehicle-treated or NSAID-treated mice, qRT-PCR was 
performed with the TaqMan gene expression assay kit (Life Technologies) 
(Supplementary Table 2), with complementary DNA generated from the high 
capacity cDNA reverse transcription kit (Life Technologies). Microfluidic qRT- 
PCR was performed on BioMark Dynamic Arrays according to manufacturer’s 
instructions (Fluidigm Corporation). 

Micro-computed tomography. Formalin-fixed tibiae and femora were imaged 
with micro-computed tomography using a microCT-viva 40 (Scanco Medical AG) 
with a voxel size of 10.5 um in all dimensions (” = 5). The bones were mounted in 
a cylindrical specimen holder to be captured in a single scan. Bones were secured in 
the specimen holder with gauze and were completely submerged in 70% ethanol. 
The region of interest comprised 100 transverse computed tomography slices. 
Scans with an isotropic resolution of 10.5 um were made using a 55-kV peak 
voltage X-ray beam. The fractional trabecular bone volume (BV/TV) and archi- 
tectural properties of trabecular reconstructions, apparent trabecular thickness 
(Tb.Th.), trabecular number (Tb.N.), trabecular spacing (Tb.Sp.), and connectivity 
density (Conn.D.) were calculated. 

Dynamic and static bone histomorphometry. Dynamic bone formation assays 
were performed using staggered calcein double labelling, as described previously”*. 
Bone histomorphometry was performed on 7-1m thick sections of undecalcified 
femurs embedded in methylmethacrylate using standard procedures. The mineral 
apposition rate (mm per day), mineralizing surface and bone formation rate (mm* 
mm ~ per day) were measured on femora. Modified Goldner’s trichrome staining 
procedure was performed on 7-t1m thick sections of undecalcified femurs embed- 
ded in methylmethacrylate. The osteoid surfaces as well as quiescent surfaces were 
measured on the tissue sections. Bone marrow sinusoids were visualized with anti- 
VEFGFR3 on 3.5-1um section. Vessels were identified by the positive staining 
around the vessel walls, and vessel areas were measured using the automated 
measuring system and expressed as a percentage or tissue volume. Vessel surface 
was traced with the same automated system. Vessel wall that showed an intact 
epithelial surface was expressed as endothelial surface over total vessel surface. For 
Col2.3-GFP analysis, 3.5-jum thick sections were obtained from treated Col2.3- 
GFP mice. Sections were visualized with a fluorescent microscope (Leica D100) 
using a FITC filter. Images were captured at 400 magnification at four different 
areas in the mid-shaft of the femur. GFP* osteoblasts were counted on endocor- 
tical bone surfaces, and data were expressed as the number of osteoblasts per 
endocortical bone surface. Osteoblast surface was measured and expressed as 
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the endocortical bone surface covered by osteoblasts. All histomorphometry 
experiments were done on images captured using a Leica microscope outfitted 
with Q-imaging camera (W. Nuhsbaum), and the histomorphometry analysis was 
done using Bioquant Osteo software automated measuring system (Bioquant 
Imaging Corporation). All histomorphometry values were expressed according 
to the standard nomenclature*”*. 

Immunohistochemistry. Immunohistochemical analysis was performed on 
decalcified paraffin-embedded tissue sections. Rabbit polyclonal antibodies 
against N-cadherin and SDF1 N-terminal were from Abcam. Secondary anti- 
rabbit antibodies were from Vector Laboratories. Anti-mouse OPN was from 
R&D Systems. Secondary biotinylated anti-goat horseradish peroxidase (HRP) 
conjugate, HRP-DAB system and DAV chromogen were from R&D Systems. 
HRP-rat IgG2 isotype was used as a primary antibody negative control for 
SDF-1, OPN and N-cadherin, at 1:50. Isotype staining control was performed 
under the same conditions as the antibody staining. 

COX metabolite and activity analysis. Mice were subcutaneously treated with 
vehicle control or meloxicam, twice daily. Onehour after the last treatment, 
femurs were pulled and flushed with 1 ml of ice-cold PBS, quickly brought to 
single-cell suspension and then flash frozen. COX-1- and COX-2-derived meta- 
bolites were assessed by gas chromatography—mass spectrometry as we have previ- 
ously described”. The second femur was processed in an identical way, and 
COX-1 and COX-2 activity were determined using a fluorescent COX activity 
assay following the manufacturer’s instructions (Cayman Chemicals). 


27. Mignone, J. L., Kukekov, V., Chiang, A. S., Steindler, D. & Enikolopov, G. Neural stem 
and progenitor cells in nestin-GFP transgenic mice. J. Comp. Neurol. 469, 311-324 
(2004). 

28. Kalajzic, Z. et al. Directing the expression of a green fluorescent protein transgene 
in differentiated osteoblasts: comparison between rat type | collagen and rat 
osteocalcin promoters. Bone 31, 654-660 (2002). 

29. Chen, X.S., Sheller, J. R., Johnson, E. N. & Funk, C. D. Role of leukotrienes revealed 
by targeted disruption of the 5-lipoxygenase gene. Nature 372, 179-182 (1994). 

30. King, A. G. et al. Rapid mobilization of murine hematopoietic stem cells with 
enhanced engraftment properties and evaluation of hematopoietic progenitor cell 
mobilization in rhesus monkeys by a single injection of SB-251353, a specific 
truncated form of the human CXC chemokine GROB. Blood 97, 1534-1542 
(2001). 

31. Pelus, L.M., Broxmeyer, H. E., Kurland, J. |. & Moore, M. A. Regulation of 
macrophage and granulocyte proliferation. Specificities of prostaglandin E and 
lactoferrin. J. Exp. Med. 150, 277-292 (1979). 

32. Broxmeyer, H. E. et al. SDF-1/CXCL12 enhances in vitro replating capacity of 
murine and human multipotential and macrophage progenitor cells. Stem Cells 
Dev. 16, 589-596 (2007). 

33. Fukuda, S., Bian, H., King, A. G. & Pelus, L.M. The chemokine GROB mobilizes early 
hematopoietic stem cells characterized by enhanced homing and engraftment. 
Blood 110, 860-869 (2007). 

34. Hoggatt, A. F., Hoggatt, J., Honerlaw, M. & Pelus, L. M.A spoonful of sugar helps the 
medicine go down: a novel technique to improve oral gavage in mice. J. Am. Assoc. 
Lab. Anim. Sci. 49, 329-334 (2010). 

35. Sutherland, D. R., Anderson, L., Keeney, M., Nayar, R. & Chin-Yee, I. The ISHAGE 
guidelines for CD34" cell determination by flow cytometry. J. Hematother. 5, 
213-226 (1996). 

36. Mohammad, K. S. et al. Pharmacologic inhibition of the TGF-B type | receptor 

kinase has anabolic and anti-catabolic effects on bone. PLoS ONE 4, e5275 

(2009). 

37. Rowe, P. S. et al. Correction of the mineralization defect in hyp mice treated with 

protease inhibitors CAO74 and pepstatin. Bone 39, 773-786 (2006). 

38. Parfitt, A.M. et al. Bone histomorphometry: standardization of nomenclature, 

symbols, and units. Report of the ASBMR Histomorphometry Nomenclature 

Committee. J. Bone Miner. Res. 2, 595-610 (1987). 

39. Murali, G. et al. Fish oil and indomethacin in combination potently reduce 

dyslipidemia and hepatic steatosis in LDLR-’~ mice. J. Lipid Res. 53, 2186-2197 

(2012). 

AO. Liu, T. etal. Prostaglandin Ep deficiency uncovers a dominant role for thromboxane 
Az in house dust mite-induced allergic pulmonary inflammation. Proc. Nat! Acad. 
Sci. USA 109, 12692-12697 (2012). 


©2013 Macmillan Publishers Limited. All rights reserved 


| sid ial Be 


doi:10.1038/nature11925 


NANOG-dependent function of TET1 and TET2 in 
establishment of pluripotency 


Yael Costa!, J unjun Ding”*, Thorold W. Theunissen**, Francesco Faiola?*, Timothy A. Hore’*, Pavel V. Shliaha?, Miguel Fidalgo’, 
Arven Saunders’, Moyra Lawrence’, Sabine Dietmann', Satyabrata Das®, Dana N. Levasseur®, Zhe Li’, Mingjiang Xu’, 


Wolf Reik*, José C. R. Silval?* & Jianlong Wang?* 


Molecular control of the pluripotent state is thought to reside in a core 
circuitry of master transcription factors including the homeodomain- 
containing protein NANOG”™, which has an essential role in establish- 
ing ground state pluripotency during somatic cell reprogramming**. 
Whereas the genomic occupancy of NANOG has been extensively 
investigated, comparatively little is known about NANOG-associated 
proteins’ and their contribution to the NANOG-mediated repro- 
gramming process. Using enhanced purification techniques and a 
stringent computational algorithm, we identify 27 high-confidence 
protein interaction partners of NANOG in mouse embryonic stem 
cells. These consist of 19 previously unknown partners of NANOG 
that have not been reported before, including the ten-eleven trans- 
location (TET) family methylcytosine hydroxylase TET1. We confirm 
physical association of NANOG with TET1, and demonstrate that 
TET1, in synergy with NANOG, enhances the efficiency of reprogram- 
ming. We also find physical association and reprogramming synergy 
of TET2 with NANOG, and demonstrate that knockdown of TET2 
abolishes the reprogramming synergy of NANOG with a catalytically 
deficient mutant of TET1. These results indicate that the physical 
interaction between NANOG and TET1/TET2 proteins facilitates 
reprogramming in a manner that is dependent on the catalytic activity 
of TET1/TET2. TET1 and NANOG co-occupy genomic loci of genes 
associated with both maintenance of pluripotency and lineage com- 
mitment in embryonic stem cells, and TET1 binding is reduced upon 
NANOG depletion. Co-expression of NANOG and TET1 increases 
5-hydroxymethylcytosine levels at the top-ranked common target loci 
Esrrb and Oct4 (also called Pou5f1), resulting in priming of their 
expression before reprogramming to naive pluripotency. We propose 
that TET1 is recruited by NANOG to enhance the expression of a 
subset of key reprogramming target genes. These results provide an 
insight into the reprogramming mechanism of NANOG and uncover 
a new role for 5-methylcytosine hydroxylases in the establishment of 
naive pluripotency. 

We expanded the NANOG interactome in mouse embryonic stem 
cells using an improved affinity purification and mass spectrometry 
(AP-MS) strategy®’ similar to that described previously*, combined 
with an interactomics analysis (see Supplementary Information). This 
analysis identified 27 high-confidence interaction partners of NANOG 
(Fig. 1a, b, Supplementary Figs 1-4 and Supplementary Tables 1 and 
2). Notable among the 19 previously unknown interaction partners 
of NANOG was the methylcytosine hydroxylase TET1°’° (Fig. 1b). 
Specific association of TET1 with NANOG was detected in all five 
affinity purification runs of three independent affinity purifications 
(Supplementary Fig. 5b and Supplementary Table 2), and the interaction 
between NANOG and TET1 was further confirmed by immunopre- 
cipitation and co-immunoprecipitation (Fig. lc and Supplementary 


Fig. 5c, d). Whereas NANOG clearly associates with TET 1 in embryonic 
stem cells, there also exists TET1-free NANOG protein as shown by 
immunodepleting TET1 in embryonic stem cells (Supplementary 
Fig. 5e). Notably, among the 27 high-confidence interaction partners 
of NANOG, at least 5 (NACC1, SGOL2, QSER1, HDAC2 and OCT4) 
were also associated with TET1 by co-immunoprecipitation and/or 
immunoprecipitation-mass spectrometry experiments (Supplemen- 
tary Fig. 5f-h). Expression of TET1, like that of NANOG, is upregu- 
lated during reprogramming to pluripotency (Supplementary Fig. 6a). 
Because NANOG is a critical determinant during establishment of 
pluripotency*"', we investigated whether TET1 may also be required 
for efficient nuclear reprogramming. Indeed, RNA interference- 
mediated inhibition of TET1 during reprogramming reduced gene- 
ration of induced pluripotent stem (iPS) cells from mouse embryonic 
fibroblasts (Supplementary Fig. 6b-g and Supplementary Fig. 7). The 
requirement of TET1 for efficient reprogramming was confirmed using 
Tet1 ‘~ embryonic stem cells'? in an independent, heterokaryon-based 
reprogramming system’ (Supplementary Fig. 8). 

The physical association of TET1 with NANOG prompted us to 
consider whether TET1 may modulate NANOG function in establish- 
ing pluripotency. NANOG and TET] are only minimally expressed in 
reprogramming intermediates resulting from retroviral infection of 
neural stem cells with the reprogramming factors OCT4, KLF4 and 
c-Myc (rOKM) (Fig. 2a). We addressed whether NANOG-mediated 
reprogramming of these cells requires functional contribution of 
TETI. A clonal line of reprogramming intermediates was transfected 
with a PiggyBac NANOG transgene followed by addition of short 
interfering RNA (siRNA) against Tetl (Supplementary Fig. 9a, b). 
Downregulation of TET1 reduced NANOG reprogramming efficiency 
by 26-fold compared with the non-targeting control (Fig. 2b and 
Supplementary Fig. 9c), indicating that TET1 and/or its associated 
catalytic activity may be a limiting factor for reprogramming by 
NANOG. 

We then asked if ectopic TET1 expression could enhance NANOG 
reprogramming activity. Neural stem cells + rOKM were transfected 
with PiggyBac vectors expressing NANOG, TET] or TET1 bearing 
two mutations in the catalytic domain (TET1(H1671Y, D1673A), 
hereafter TET1Mut)"® (Supplementary Fig. 10a). Individual expres- 
sion of wild-type TET1 (TET1WT) or TET1Mut did not have a 
significant effect on generation of green fluorescent protein (GFP)- 
positive colonies in cells containing a GFP gene driven by Oct4 regu- 
latory sequences (Fig. 2c and Supplementary Fig. 10b, c). In contrast, 
NANOG expression enhanced the generation of iPS cell colonies by 
more than tenfold (Fig. 2c and Supplementary Fig. 10b, c), in accord- 
ance with previous studies'’"*. Importantly, NANOG-mediated repro- 
gramming efficiency was further augmented by up to fourfold in the 
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Figure 1 | Identification of TET1 as a novel partner of NANOG. 

a, Schematic depiction of embryonic stem cells expressing NANOG with Flag 
(FL) and Biotin (BIO) tags (left), and NANOG antibody (Ab)-based affinity 
purification (right). SA1-SA3, three independent streptavidin pull-down 
experiments. b, List of 27 preys identified as true interactors ordered by 
combined cumulative probability (CCP) score. Candidates shaded in green are 
the ones whose interaction with NANOG has been validated previously*”*”° or 
in this study by immunoprecipitation/co-immunoprecipitation. Two 


presence of TETIWT transgene, a synergistic effect that is highly 
reproducible (Fig. 2c and Supplementary Fig. 10b-e). A similar repro- 
gramming synergy was also observed for the combination of NANOG 
with TET1Mut (Fig. 2c and Supplementary Fig. 10b-e). iPS cells 
derived with NANOG and either TETIWT or TET1Mut transgenes 
contributed to the germ lineage and live-born chimaeras following 
blastocyst injection (Fig. 2d, e). Together, our data show that 
NANOG and TET1 enhance the efficiency of somatic cell reprogram- 
ming in a cooperative manner. This conclusion was corroborated in 
mouse embryonic fibroblasts, where the combined action of NANOG 
and TETIWT increased reprogramming efficiency by up to 16-fold 
(Supplementary Fig. 11). 

To explore the molecular mechanism underlying the NANOG- 
TET1 partnership during reprogramming, we quantified global 
5-hydroxymethylcytosine (5hmC) levels'’. As expected, 5hmC levels 
were increased upon TET1WT but not TET1 Mut expression in neural 
stem cells + rOKM (Fig. 3a, left). Unexpectedly, co-expression of 
NANOG and either TETIWT or TET1Mut resulted in increased 
5hmC levels (Fig. 3a, left). These results indicate that NANOG can 
potentiate 5hmC modifications by its association with TET 1, and that 
transcriptional activation of endogenous TET1 and/or its paralogue 
TET2 may compensate for the lack of catalytic activity of TET1Mut 
during reprogramming with NANOG. Indeed, TET2 was upregulated 
by NANOG and TETLWT or TET1Mut, and its expression levels 
follow a very similar trend to that of 5hmC/C levels (Fig. 3a and 
Supplementary Fig. 10f, g). TET2 was identified in two out of three 
independent affinity purifications in our NANOG interactomics study 
(Supplementary Fig. 12a, b), and physical association of TET2 with 
NANOG was confirmed by immunoprecipitation/co-immunopreci- 
pitation (Supplementary Fig. 12c, d). TET2 was recently found to 
contribute to an epigenetic program that directs subsequent transcrip- 
tional induction at the pluripotency loci Nanog and Esrrb during the 
early stage of somatic cell reprogramming'®. TET1 and TET2 share the 
common carboxy-terminal catalytic domain but are divergent in their 
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previously identified NANOG partners, DAX1 and ZFP281°, were identified by 
mass spectrometry (Supplementary Fig. 5a), but not selected as high- 
confidence interactors using our stringent criteria. c, Validation of NANOG- 
TET1 interaction by immunoprecipitation/co-immunoprecipitation in 
embryonic stem (left) and HEK293T (right) cells. The asterisk indicates the 
presence of TET1 in input that can be visualized under longer exposure 
(Supplementary Fig. 5c). IP, immunoprecipitation; WB, western blot. 


amino termini for a CXXC DNA-binding domain, which renders 
TET2 functionally similar to a truncated form of TET1, TET1C 
(Fig. 3b). We investigated whether the catalytic activity of TET1 is 
sufficient to enhance NANOG-mediated reprogramming. Indeed, 
TETIC acts together with NANOG to enhance reprogramming 
(Fig. 3c) and retains its physical association with NANOG (Fig. 3d). 
Not surprisingly, we also observed reprogramming synergy between 
NANOG and TET2 (Fig. 3c and Supplementary Fig. 13). TET1 and 
TET2 function is redundant in the context of NANOG-induced repro- 
gramming, as exogenously expressing both TET enzymes together 
with NANOG does not enhance somatic cell reprogramming beyond 
expressing NANOG with either individual enzyme (Fig. 3c). 

Given that endogenous TET2 was upregulated in the presence of 
NANOG and TET1Mut (Fig. 3a right and Supplementary Fig. 10f), 
and that TET2 synergizes with NANOG during reprogramming 
(Fig. 3c), we tested whether knockdown of Tet2 could abrogate the 
reprogramming synergy of NANOG and TET. Indeed, siRNAs direc- 
ted against Tet2 diminished the reprogramming synergy of NANOG 
and TET1Mut, but did not affect that of NANOG and TETIWT 
(Fig. 3e, f). This result confirms that TET2 activation compensates 
for the lack of catalytic activity of TET1Mut during reprogramming 
with NANOG. Together, our results demonstrate that neither TET1 
nor TET2 is sufficient for the induction of pluripotency (Fig. 2c, 
Supplementary Figs 10 and 11, and Fig. 3c), but either enzyme can 
partner with NANOG to enhance reprogramming of somatic cells to 
naive pluripotency. 

We compared deposited chromatin immunoprecipitation coupled 
with DNA sequencing (ChIP-Seq) data for both NANOG’””* and 
TET1” and found a statistically significant overlap between 
NANOG and TET1 binding sites in the mouse embryonic stem cell 
genome (P <2 X 10™*, permutation test) (Fig. 4a and Supplementary 
Table 3). Gene ontology analysis revealed that genes with roles in 
‘multicellular organismal development’ and ‘positive regulation of 
transcription from Pol II promoter’ are enriched in the common 
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Figure 2 | Synergy between NANOG and TETI during reprogramming. 

a, NANOG and TET] are specifically expressed in pluripotent cells. ESC, 
embryonic stem cells; NSC, adult neural stem cells; +rOKM, transduced with 
retroviral Oct4, Klf4 and c-Myc transgenes. b, Knockdown of TET1 
compromises reprogramming activity of a constitutive NANOG transgene in 
reprogramming intermediates. siNT, non-targeting siRNA control; siTet1, 
siRNA against Tet1. Sequences for quantitative PCR primers and siRNA targets 
are listed in Supplementary Table 5. c, Both wild-type and mutant TET1 
enhance NANOG-dependent reprogramming. Quantification of the number of 
iPS colonies at day 10 of 2i+LIF (dual inhibition of mitogen-activated protein 
kinase signalling (PD0325901, 1 1M) and glycogen synthase kinase-3 (GSK3) 
(CHIR99021, 3 uM) with leukaemia inhibitory factor) treatment is shown in 
Supplementary Fig. 10b. EV, empty vector; N, NANOG. d, e, Contribution of 
iPS cells generated with NANOG and TETIWT (top) or TET1Mut (bottom) 
transgenes to the germline at E12.5 (d) and live-born chimaeras (e). Oct4-GFP, 
GFP gene driven by Oct4 regulatory sequences. Error bars indicate standard 
deviation (n = 3). 


targets (Supplementary Fig. 14). We ranked the common target genes 
of NANOG and TET1 based on the number of overlapping ChIP-Seq 
peaks in four studies (Fig. 4b and Supplementary Table 4). Among the 
common targets with the highest number of overlapping NANOG and 
TET1 peaks was Esrrb (Fig. 4b). To investigate whether NANOG may 
be required to direct TET1 to shared target genes, we used embryonic 
stem cells containing an inducible Nanog transgene in a Nanog ‘~ 
background”' (Fig. 4c, d). Loss of NANOG expression reduced TET1 
binding to a number of common targets, including Esrrb (Fig. 4e, fand 
Supplementary Fig. 15). NANOG-dependent binding of TET1 to the 
Esrrb locus seems to be independent of OCT4, as OCT4 is not present 
at the same genomic location (Fig. 4e). Thus, NANOG is responsible 
for the recruitment of TET1 to a subset of shared genomic loci that are 
implicated in both the regulation of pluripotency (for example, Esrrb) 
and lineage commitment (for example, Pax6). Such NANOG-depend- 
ent target binding of TET] is highlighted by the fact that the truncated 
form of TET1 lacking the CXXC DNA binding domain (that is, 
TETIC) maintains its physical interaction and reprogramming syn- 
ergy with NANOG (Fig. 3b-d). 

Because the functional synergy between NANOG and TET enzymes 
was dependent on catalytic activity, we examined 5hmC levels at 
NANOG/TET1 peaks in mouse embryonic stem cells. A recent study 
reported ShmC enrichment at promoter-distal NANOG binding sites 
in human embryonic stem cells** (Supplementary Fig. 16a). In con- 
trast, we observed an inverse correlation between 5h5mC and NANOG/ 
TET! binding at actively expressed target genes in mouse embryonic 
stem cells (Supplementary Fig. 16). This led us to consider whether 
5hmC may be transiently deposited to common NANOG/TETI tar- 
gets before the establishment of pluripotency, that is, during in vitro 
reprogramming when NANOG is required®. We focused on target 
gene regulation of Esrrb and Oct4, two key pluripotency genes that 
are among top ranked common targets of NANOG and TET1 
(Fig. 4b). Significantly, we observed expression priming of both 
Esrrb and Oct4 by combined expression of NANOG with TET1WT, 
TET1IMut, TETIC or TET2 in reprogramming intermediates of two 
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Figure 3 | Synergy between NANOG and TET1/TET2 during 
reprogramming is dependent upon catalytic activity of TET1/TET2. 

a, Measurement of global levels of ShmC relative to unmodified C (left, 5hmC/ 
C) and TET2 expression (right) in reprogramming intermediates transfected 
with PiggyBac (PB) transgenes. b, Schematic depiction of wild-type TET1 and 
TET2 (TET1LWT and TET2WT, respectively), and the truncated TET1 mutant 
(TET1C). Note the absence of a CXXC DNA binding domain in TET2WT and 
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TET1C proteins. CD, catalytic domain. c, Quantification of GEP* iPS colonies. 
d, Physical association of NANOG with TET1C. Co-immunoprecipitation was 
performed in HEK293T cells. HRP, horseradish peroxidase. e, TET2 
knockdown (siTet2) reduces reprogramming efficiency in intermediate cells 
transgenic for NANOG+TET1Mut compared to NANOG+TET1WT. Non- 
targeting siRNA (siNT) serves as a control. f, Quantification of the number of 
iPS colonies in (e). Error bars indicate standard deviation (n = 3). 
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Figure 4 | Mechanism and genome-wide significance of the NANOG-TET1 
interaction. a, Scatterplot showing the observed against expected overlap in 
genomic binding sites of NANOG and TET! according to comparisons 
performed in Supplementary Table 3. b, Ranked list of common targets of 
NANOG and TET1 based on the comparisons in Supplementary Table 3. 

c, Schematic representation of embryonic stem cells harbouring a doxycycline 
(Dox)-suppressible NANOG transgene in a Nanog '~ genetic background”. 
d, Western blot analysis of OCT4, NANOG, and TET1 expression in NgcKO 


independent cellular systems (Fig. 4g and Supplementary Figs 11g and 
13b). More importantly, we detected increased 5hmC and decreased 
5-methylcytosine (5mC) levels at these loci when NANOG is co- 
expressed with TET1 (Fig. 4h). Thus, NANOG and TET! act before 
the transition to naive pluripotency by inducing local transcriptional 
changes in shared target genes that are critically involved in the regu- 
lation of pluripotency. 

In summary, we identified 5mC hydroxylases TET1 and TET2 as 
novel interaction partners of NANOG. TET1/TET2 and NANOG 
synergistically enhance the efficiency of reprogramming and this 
phenotype is dependent on the hydroxylation of 5mC to ShmC during 
somatic cell reprogramming. This study thus provides mechanistic 
insight into how NANOG establishes pluripotency, demonstrating 
that interactions between NANOG and epigenetic regulators fine-tune 
induced pluripotency. Future experimental work is needed to delineate 
the precise composition of NANOG-TET1/TET2 protein complexes, 
and the contribution of other interaction partners to the reprogram- 
ming mechanism described herein. Our work supports an emerging 
view that TET proteins can overcome epigenetic roadblocks during 
reprogramming and transdifferentiation’*”. 


METHODS SUMMARY 

Affinity purification coupled with mass spectrometry (AP-MS). Nuclear 
extraction and affinity purification of Flag-biotin-NANOG-associated protein 
complexes were performed as previously described’, with several modifications 
as described’. Three biological replicates were performed for streptavidin agarose- 
based affinity purification and one each for Flag and NANOG antibody-based 
affinity purifications. Liquid chromatography coupled with tandem mass spectro- 
metry (LC-MS/MS) was used by the Taplin Biological Mass Spectrometry Facility 
at Harvard Medical School to sequence and identify NANOG affinity purification 
samples. 

Reprogramming assays. To investigate the consequences of NANOG and TET1 
co-expression during reprogramming, adult neural stem cells were infected with 
pMxX-based retroviral reprogramming factors”. Cultures were switched to embry- 
onic stem cell medium (serum/leukaemia inhibitory factor (LIF)) at day 3 post- 
transduction. A clonal line of proliferative, Oct4-GFP negative cells (reprogramming 


embryonic stem cells treated with (+) or without (—) Dox. e, Overlapping 
peaks of TET1 and NANOG from ChIP-Segq studies'””” in the Esrrb locus. 

f, Relative enrichment of TET1 in the absence (-) and presence (+) of Dox in the 
Esrrb genomic locus as shown in (e). g, Transcriptional priming of Esrrb and Oct4 
by NANOG and TET] in reprogramming intermediates. h, Relative enrichment 
of 5hmC and 5mC in the Esrrb and Oct4 loci. Error bars indicate standard 
deviation (n = 3). Primers used for glucosylated hydroxymethyl-sensitive qPCR 
are listed in Supplementary Table 6. 


intermediates) was transfected using nucleofection (Amaxa) with various combina- 
tions of NANOG and TET! PiggyBac transgenes. Selection for stable transgene 
expression was applied to transfectants for a minimum of 12 days and maintained 
until medium switch to 2i (dual inhibition of mitogen-activated protein kinase 
signalling (PD0325901, 1M) and glycogen synthase kinase-3 (GSK3) 
(CHIR99021, 31M))+LIF. Puromycin selection for an Oct4-GFP-IRES-puro 
reporter transgene was applied from day 6 of 2i+LIF treatment. GFP-positive 
colonies were scored at day 10. Similar reprogramming assays were applied to 
Nanog-GFP-IRES-puro reporter mouse embryonic fibroblasts with modifications 
described in Supplementary Information. 
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Multiple phases of chondrocyte enlargement 
underlie differences in skeletal proportions 


Kimberly L. Cooper'*, Seungeun Oh?*, Yongjin Sung®, Ramachandra R. Dasari*, Marc W. Kirschner? & Clifford J. Tabin’ 


The wide diversity of skeletal proportions in mammals is evident 
upon a survey of any natural history museum’s collections and 
allows us to distinguish between species even when reduced to their 
calcified components. Similarly, each individual is comprised of a 
variety of bones of differing lengths. The largest contribution to the 
lengthening of a skeletal element, and to the differential elongation 
of elements, comes from a dramatic increase in the volume of hyper- 
trophic chondrocytes in the growth plate as they undergo terminal 
differentiation’. However, the mechanisms of chondrocyte volume 
enlargement have remained a mystery* "*. Here we use quantitative 
phase microscopy” to show that mammalian chondrocytes undergo 
three distinct phases of volume increase, including a phase of mas- 
sive cell swelling in which the cellular dry mass is significantly 
diluted. In light of the tight fluid regulatory mechanisms known 
to control volume in many cell types’’, this is a remarkable mecha- 
nism for increasing cell size and regulating growth rate. It is, how- 
ever, the duration of the final phase of volume enlargement by 
proportional dry mass increase at low density that varies most 
between rapidly and slowly elongating growth plates. Moreover, 
we find that this third phase is locally regulated through a mecha- 
nism dependent on insulin-like growth factor. This study provides a 
framework for understanding how skeletal size is regulated and for 
exploring how cells sense, modify and establish a volume set point. 

Each of the long bones initially forms in the embryo as a similarly 
sized cartilage rudiment that only subsequently undergoes differential 
regulation of growth. The elongation ofa skeletal element occurs at the 
growth plate’*”’, each consisting of three distinct zones: resting round 
chondrocytes near the end of an element give rise to clonal columns of 
flattened proliferating chondrocytes that then terminally differentiate 
into hypertrophic chondrocytes nearest the bony centre of an element. 
Although multiple cellular parameters contribute to lengthening of 
skeletal elements—including proliferation, matrix deposition and 
hypertrophic cell enlargement—the greatest contribution to growth 
rate in mammals is due to the massive volume enlargement of hyper- 
trophic chondrocytes expanding the skeletal tissue in the direction of 
longitudinal growth within laterally restricting matrix channels’*. In 
addition to being the largest contributor to the elongation rate of a 
given skeletal element, this parameter is largely responsible for the 
difference in growth rates between different skeletal elements within 
an individual, between homologous elements in different species, and 
within a single element as an animal ages*’. Surprisingly, however, 
given its critical importance in determining the growth rate of each 
bone and the overall stature of the individual, the mechanism is poorly 
understood. It even remains unclear whether hypertrophic chondro- 
cyte volume increases by true hypertrophy, maintaining constant den- 
sity during growth through an increase in macromolecules and 
organelles, or by cell swelling through disproportionate fluid uptake, 
which is ordinarily a hallmark of disease’’. 

To determine whether hypertrophic chondrocytes enlarge by cell- 
ular hypertrophy and/or swelling, we applied methods of diffraction 


phase microscopy to measure the dry mass of individual unstained live 
cells dissociated from growth plate cartilage. (Fig. 1a and Supplemen- 
tary Methods)'*. Together with volume information about the sample, 
calculated here based on a well-supported spherical approximation for 
dissociated chondrocytes (Supplementary Methods), this allows the 
calculation of dry mass density. Diffraction phase microscopy mea- 
surements of a variety of cell types consistently reflect a ‘normal’ dry 
mass density for healthy living cells at approximately 0.182 pg per 
femtolitre (fl) in agreement with the concentration of cytoplasm previ- 
ously determined by index matching in human oral epithelial cells”. 
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Figure 1 | Hypertrophic chondrocytes increase in volume through three 
distinct phases including a phase of massive cell swelling. a, Quantitative 
phase image of dissociated mouse proximal tibia hypertrophic chondrocytes at 
postnatal day 5 (P5). Colour bar represents phase shift in radians. b, Volume 
versus dry mass plotted for individual chondrocytes. Linear regression for cells up 
to 1,000 fl highlights divergence of larger cells from an initial slope of 0.183 pg fl’. 
c, d, Log scale (c) and linear scale (d) plots of volume versus dry mass density. 
Lambda for the smoothened spline in (c) and (d) is 5 X 10°. R? value is 0.84. 
n= 1,249 cells summed across five independent experiments. e, f, Horizontal 
cross sections from regularized tomographic phase microscopy density 
reconstructions of a small (e) and large (f) mouse tibia chondrocyte. Colour bar 
represents refractive index and thus dry mass density. Scale bars, 10 um. 
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This includes maturing megakaryocytes that reach volumes compar- 
able to the largest hypertrophic chondrocytes and ten times the aver- 
age somatic cell volume (Supplementary Fig. 1). 

In contrast, analysis of chondrocytes from the rapidly elongating 
mouse proximal tibia reveals that there are three distinct phases of 
hypertrophic cell enlargement. In phase 1, an initial increase of about 
threefold from approximately 600 fl to 2,000 fl is characterized by 
true hypertrophy—a proportionate increase in dry mass production 
and fluid uptake thus maintaining the normal dry mass density at 
0.183 pg fl‘ (Fig. 1b, c). In phase 2, a fourfold enlargement from about 
2,000 fl to 8,000 fl is characterized by cell swelling. Volume increases at 
a rate disproportionate to the continuing rate of dry mass production 
resulting in a dramatic dilution of dry mass density to approximately 
0.07 pg fl” 1 (Fig. 1c). In phase 3, at volumes larger than 8,000 fl, the dry 
mass density once again stabilizes, and cells continue to enlarge 
another twofold to about 14,000 fl by proportionately increasing dry 
mass and fluid volume at this lower density (Fig. 1d). Swelling in phase 
2 allows cells to reach volumes two to three times greater in phase 3 
than if they relied entirely on the proportionate increase in dry mass at 
high density (Fig. 1b, linear regression). 

To verify that this decrease in density is characteristic of hyper- 
trophic differentiation, we imaged a subset of the small high-density 
cells and large low-density cells using regularized tomographic phase 
microscopy to generate a refractive index map of dry mass density in 
three-dimensions (Supplementary Methods). This independent 
approach confirms that the largest cells reduce their dry mass density 
by approximately 60%, and moreover indicates that dry mass is low 
throughout the cytoplasm with a slightly higher density ring around 
the nucleus (Fig. le, f and Supplementary Fig. 14). 

Understanding the cellular process by which hypertrophic cells 
enlarge provides a framework for considering how that process is 
modulated to achieve differential growth of individual elements within 
a species and of homologous elements between species. In contrast to 
the large hypertrophic chondrocytes of the rapidly elongating proxi- 
mal tibia, the slowly elongating proximal radius has much smaller 
hypertrophic chondrocytes’. We find that these cells go through phase 
1 and enter phase 2, similar to the cells of the proximal tibia, reaching a 
volume of approximately 5,000 fl. However, they truncate the remain- 
der of phase 2 after dilution of dry mass density to approximately 
0.10 pg fl and completely eliminate phase 3 (Fig. 2a). 

We next compared growth plates of the mouse to those of the lesser 
Egyptian jerboa, Jaculus jaculus, a small bipedal rodent with greatly 
elongated hindlimbs. In particular, the metatarsals of jerboa feet 
rapidly elongate during early postnatal development to approximately 
2.5 times the relative proportion of mouse metatarsals’*. The mouse 
distal metatarsal growth plate is intermediate in its growth rate and 
hypertrophic chondrocyte size compared to the mouse proximal tibia 
and radius (Fig. 2b and Supplementary Fig. 2). Although other aspects 
of the jerboa distal metatarsal growth plate are also altered, including 
the total cell number in each zone (Supplementary Fig. 3), the height of 
individual hypertrophic chondrocytes is increased by 58% compared 
to the metatarsal of the mouse, suggesting a significant contribution of 
hypertrophic chondrocyte volume to the increased rate of growth of 
this element (Fig. 2b, e, f). In contrast, the jerboa proximal tibia hyper- 
trophic chondrocytes are only slightly larger than their counterparts in 
the mouse (Fig. 2b-d). Unlike the metatarsals, the metacarpals of the 
jerboa forelimb, as well as other bones of the forelimb, are similar in 
size and proportions to those of the mouse with hypertrophic chon- 
drocytes of comparable size (Fig. 2b). 

The hypertrophic chondrocytes of the jerboa proximal tibia show 
very similar growth properties to those of the mouse when examined 
by diffraction phase microscopy, including all three phases of volume 
enlargement (Supplementary Fig. 4). However, there is a striking dif- 
ference between the metatarsal chondrocytes of the two species. 
Although mouse metatarsal hypertrophic chondrocytes are indeed 
intermediate in size between the proximal radius and proximal tibia, 
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reaching a maximum volume of about 8,000 fl by completing phases 1 
and 2 and truncating phase 3 (Fig. 2g), hypertrophic chondrocytes of 
the jerboa metatarsals increase almost 40-fold from their initial volume 
to approximately 23,000 fl, greater than the volume of tibia chondro- 
cytes in either species (Fig. 2h). This is accomplished by following the 
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Figure 2 | Differences in cell size associated with different skeletal growth 
rates are attributed to modulating a common growth trajectory. 

a, Dissociated postnatal day 5 (P5) mouse proximal radius chondrocytes (black, 
n = 292 cells) compared to proximal tibia chondrocytes (grey in all panels, data 
from Fig. 1). The x axis is in log scale. b, Quantification of average maximum 
cell height = s.e.m. for n = 3 animals of each species at P7 (>50 cells per growth 
plate). Two-tailed student’s t-test shows significant differences in the tibia and 
metatarsal growth plates between the two species (P< 10°) but not in the 
metacarpals (P = 0.978). MC, metacarpal growth plate; MT, metatarsal growth 
plate. c-f, Histological comparison of mouse and jerboa proximal tibia and 
distal metatarsal hypertrophic zones at P7. Scale bar, 50 jum. g, Dissociated P5 
mouse metatarsal chondrocytes (black, n = 634 cells) compared to mouse tibia 
chondrocytes. Black arrowhead approximates the end of the metatarsal 
distribution. h, Jerboa metatarsal chondrocytes (black, n = 366 cells) compared 
to mouse tibia chondrocytes. Grey arrowhead approximates the end of the 
mouse tibia distribution. 
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Figure 3 | Igfl is required for phase 3 of volume enlargement by dry mass 
production at low dry mass density. a, b, Histology of postnatal day 7 (P7) 
mouse proximal tibia hypertrophic zone of control (a, I; lof *; Hoxb6-Cre*) 
and Igf1 conditional mutant animals (b, [, lof if. Hoxb6-Cre* ). Scale bar, 50 um. 
c, Bar plot demonstrating an average maximum cell height reduction of 34% in 
Igfl conditional mutant tibia and 23% in metatarsal chondrocytes compared to 
control littermates. Error bars indicate + s.e.m. for n = 3 animals of each 
genotype (total = >80 cells per growth plate). d, Dry mass versus volume plot 
of dissociated P5 Igfl mutant proximal tibia hypertrophic chondrocytes. Red 
data points are Igfl mutant chondrocytes (n = 569 cells), black are littermate 
control chondrocytes (n = 373 cells) and grey represent the total wild-type data 
set for tibia (d, e) and metatarsal (f) (wild-type data from Figs 1 and 2). e, f, Dry 
mass density versus volume plots for Igfl mutant tibia chondrocytes (n = 569 
cells) (e) and Igf1 mutant metatarsal chondrocytes (n = 412 cells) (f). 


same triphasic growth trajectory common to chondrocytes of other 
growth plates and then extending phase 3 to reach a maximal volume 
by continued proportionate increase in dry mass and fluid volume at 
low dry mass density. 

Little is known about the molecular mechanisms that control chon- 
drocyte enlargement, or the regulation of final cell size, as few of the 
mutants affecting skeletal development have been examined for size of 
individual hypertrophic chondrocytes. An intriguing exception is the 
null mutation of the gene encoding insulin-like growth factor 1 
(Igf1)'’. Igfl functions in a variety of target tissues to promote protein 
synthesis and cell growth’*” and is strongly expressed in both proli- 
ferating and pre-hypertrophic chondrocytes. Igfl-deficient mice are 
35% smaller than controls but have the same number of hypertrophic 
chondrocytes, although each cell is 30% shorter in the direction of 
elongation, a finding we confirmed in mice where the floxed 
Igft'”"'?"" allele” was conditionally deleted from the hindlimb using 
Hoxb6-Cre (ref. 22) (Fig. 3a—c). It is additionally intriguing to note that 
there is no distinction between the heights of chondrocytes in the 
proximal tibia and distal metatarsal of this mutant (Fig. 3c), indicating 
that Igfl may play an important role in the establishment of growth- 
plate-dependent cell size. Although this effect could be modulated by 
any member of the Igf1 signalling pathway, evidence suggests a pos- 
sible role for the receptor, Igf1R, which maintains higher levels of 
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Figure 4 | Mouse proximal tibia and distal metatarsal hypertrophic 
chondrocytes rapidly increase in average cell height. Time course of the 
average and standard deviation of BrdU labelled cell height indicating the rate 
of chondrocyte size increase after the last mitotic cycle. n > 25 cells from three 
individuals for each time point and growth plate. 


expression over time in growth plates that continue to elongate at 
faster rates in maturing mice”. 

We employed diffraction phase microscopy to determine which 
phase(s) of volume enlargement are affected by Igfl. Igfl-deficient 
hypertrophic chondrocytes undergo normal phase 1 and phase 2 of 
enlargement, reaching approximately 7,000 fl largely by cell swelling, 
but do not progress to phase 3 and thus fail to further double their 
volume by the continued production of dry mass at low density 
(Fig. 3d-f). Taken together, our results indicate that there are three 
distinct phases of chondrocyte hypertrophy, and it is regulation of the 
Igfl-dependent third phase that is responsible for much of the vari- 
ation in skeletal elongation rate. 

Finally, previous studies of the neonatal bat and mouse forelimb 
indicated that the entire hypertrophic zone of each growth plate turns 
over once in about 24 h regardless of the maximum volume attained by 
individual chondrocytes, the number of hypertrophic chondrocytes, or 
the rate of growth plate elongation’. This suggests that growth plates 
elongating at different rates adjust the rate of cell volume increase to 
fall within a 24-h lifespan constraint. To assess the pace of cellular 
maturation and enlargement, we marked proliferating mouse chon- 
drocytes with 5-bromodeoxyuridine (BrdU) and followed the progres- 
sion of the first labelled cells to emerge into the post-mitotic 
hypertrophic columns onward to the chondro-osseous junction (Sup- 
plementary Fig. 5). We find that the rate of cell height increase in the 
proximal tibia and distal metatarsal growth plates is extremely rapid, 
more than tripling the height of proximal tibia chondrocytes within 
approximately 12 h (Fig. 4). Once cells reach their average final height, 
with a steeper slope in the larger cells of the tibia, they remain at this 
terminal size in the hypertrophic growth columns for an additional 
12h before turnover at the chondro-osseous junction. 

Through coordination of this multiphase process, cell swelling 
allows chondrocytes to enlarge extraordinarily rapidly while presum- 
ably lowering the energetic cost of growth, and volumes are subse- 
quently amplified in the most rapidly elongating skeletal elements by 
the continued Igfl-dependent production of mass. This unique mech- 
anism of volume enlargement suggests that chondrocyte hypertrophy 
will serve as a valuable model for cell volume homeostasis, in addition 
to our findings that provide insight into skeletal morphogenesis and 
evolution. 


METHODS SUMMARY 


The distal growth plate adjacent to the chondro-osseous junction was micro- 
dissected from P5 mouse and jerboa pups and dissociated for approximately 4h 
in 2mgml ' collagenase D at 37 °C. Cells were transferred to a 14-mm diameter 
glass-bottom dish for diffraction phase imaging. The specifications of the diffrac- 
tion phase microscope are provided in the Supplementary Methods. The interfer- 
ence image, captured by charge coupled device (CCD), was used to compute a 
quantitative phase image in MATLAB. See Supplementary Methods for the 
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computation algorithm. The quantitative phase is converted to dry mass using the 
specific refractive increment value of 0.18 mlg_'. For a subpopulation of cells, 
sphericity was determined using three-dimensional confocal morphometry and 
the refractive index contrast method (Supplementary Figs 12 and 13). As disso- 
ciated chondrocytes are spherical, we use the volume calculated from measured 
cell diameter for cells analysed by diffraction phase microscopy. Dry mass density 
was calculated by dividing the total cell dry mass by its volume. Dry mass density of 
a subset of cells was confirmed using regularized tomographic phase microscopy 
(Supplementary Fig. 14). See Supplementary Methods for details. 


Full Methods and any associated references are available in the online version of 
the paper. 


Received 18 March 2012; accepted 29 January 2013. 
Published online 13 March 2013. 


1. Wilsman, N. J., Farnum, C. E., Leiferman, E. M., Fry, M. & Barreto, C. Differential 
growth by growth plates as a function of multiple parameters of chondrocytic 
kinetics. J. Orthop. Res. 14, 927-936 (1996). 

2. Hunziker, E. B., Schenk, R. K. & Cruz-Orive, L. M. Quantitation of chondrocyte 
performance in growth-plate cartilage during longitudinal bone growth. J. Bone 
Joint Surg. Am. 69, 162-173 (1987). 

3. Hunziker, E.B.& Schenk, R.K. Physiological mechanisms adopted by chondrocytes 
in regulating longitudinal bone growth in rats. J. Physiol. (Lond.) 414, 55-71 (1989). 

4. Breur, G. J., VanEnkevort, B. A, Farnum, C. E. & Wilsman, N. J. Linear relationship 
between the volume of hypertrophic chondrocytes and the rate of longitudinal 
bone growth in growth plates. J. Orthop. Res. 9, 348-359 (1991). 

5. Kuhn, J. L., Delacey, J. H. & Leenellett, E. E. Relationship between bone growth rate 
and hypertrophic chondrocyte volume in New Zealand white rabbits of varying 
ages. J. Orthop. Res. 14, 706-711 (1996). 

6. Wilsman, N. J., Bernardini, E. S., Leiferman, E., Noonan, K. & Farnum, C. E. Age and 
pattern of the onset of differential growth among growth plates in rats. J. Orthop. 
Res. 26, 1457-1465 (2008). 

7. Farnum, C. E., Tinsley, M. & Hermanson, J. W. Forelimb versus hindlimb skeletal 
development in the big brown bat, Eptesicus fuscus: functional divergence is 
reflected in chondrocytic performance in autopodial growth plates. Cells Tissues 
Organs 187, 35-47 (2008). 

8. Buckwalter, J. A., Mower, D., Ungar, R., Schaeffer, J. & Ginsberg, B. Morphometric 
analysis of chondrocyte hypertrophy. J. Bone Joint Surg. Am. 68, 243-255 (1986). 

9. Farnum, C. E., Lee, R., O'Hara, K. & Urban, J. P. G. Volume increase in growth plate 
chondrocytes during hypertrophy: the contribution of organic osmolytes. Bone 
30, 574-581 (2002). 

10. Bush, P. G., Parisinos, C. A. & Hall, A.C. The osmotic sensitivity of rat growth plate 
chondrocytes in situ; clarifying the mechanisms of hypertrophy. J. Cell. Physiol. 
214, 621-629 (2008). 

11. Bush, P. G., Pritchard, M., Loqman, M. Y., Damron, T. A. & Hall, A. C. A key role for 
membrane transporter NKCC1 in mediating chondrocyte volume increase in the 
mammalian growth plate. J. Bone Miner. Res. 25, 1594-1603 (2010). 


378 | NATURE | VOL 495 | 21 MARCH 2013 


12. Barer, R. Interference microscopy and mass determination. Nature 169, 366-367 

(1952). 

13. Hoffmann, E. K., Lambert, |. H. & Pedersen, S. F. Physiology of cell volume 

regulation in vertebrates. Physiol. Rev. 89, 193-277 (2009). 

14. Hunziker, E. B. Mechanism of longitudinal bone growth and its regulation by 

growth plate chondrocytes. Microsc. Res. Tech. 28, 505-519 (1994). 

15. Kronenberg, H. M. Developmental regulation of the growth plate. Nature 423, 

332-336 (2003). 

16. Popescu, G., Ikeda, T., Dasari, R. R. & Feld, M. S. Diffraction phase microscopy for 

quantifying cell structure and dynamics. Opt. Lett. 31, 775-777 (2006). 

17. Barer, R. Determination of dry mass, thickness, solid and water concentration in 

Living Cells. Nature 172, 1097-1098 (1953). 

18. Cooper, K.L. The lesser Egyptian jerboa, Jaculus jaculus: a unique rodent model for 

evolution and development. Cold Spring Harb. Protocols 2011, pdb.emo066704 

(2011). 

19. Wang, J., Zhou, J. & Bondy, C. A. lgfl promotes longitudinal bone growth by 
insulin-like actions augmenting chondrocyte hypertrophy. FASEB J. 13, 1985-1990 
(1999). 

20. Oldham, S. & Hafen, E. Insulin/IGF and target of rapamycin signaling: a TOR de 
force in growth control. Trends Cell Biol. 13, 79-85 (2003). 

21. Yakar, S. Normal growth and development in the absence of hepatic insulin-like 
growth factor I. Proc. Natl Acad. Sci. USA 96, 7324-7329 (1999). 

22. Lowe, L. A., Yamada, S. & Kuehn, M. R. HoxB6-Cre transgenic mice express Cre 
recombinase in extra-embryonic mesoderm, in lateral plate and limb mesoderm 
and at the midbrain/hindbrain junction. Genesis 26, 118-120 (2000). 

23. Serrat, M. A., Lovejoy, C. O. & King, D. Age- and site-specific decline in insulin-like 
growth factor-I receptor expression is correlated with differential growth plate 
activity in the mouse hindlimb. Anat. Rec. (Hoboken) 290, 375-381 (2007). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We would like to thank T. J. Mitchison, C. E. Farnum and members 
of the Developmental Bone Morphogenesis program project grant (National Institutes 
of Health (NIH)) for helpful discussions. We also thank the Nikon Imaging Center at 
Harvard Medical School for technical support, A. Luyten and R. Shivdasani for providing 
mouse megakaryocytes and P. Ramirez for jerboa care. This work was supported by 
NIH grants PO1DKO056246 to CJ.T.; RO1GM026875 to M.W.K.; and by NIH grant 
P41RR02594, National Science Foundation (NSF) grant DBIO754339 and support 
rom the Hamamatsu Corporation to R.R.D. 


Author Contributions K.L.C. and S.O. conceived the project and carried out most of the 

experiments. Y.S. and R.R.D. carried out critical tomographic experiments validating 

he primary approaches taken. C.J.T. and M.W.K. supervised the project. K.L.C., S.0., 
W.K. and C.J.T. wrote the manuscript. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to K.L.C. 
(kcooper@genetics.med.harvard.edu). 


©2013 Macmillan Publishers Limited. All rights reserved 


METHODS 


Animals. CD-1 was chosen as the wild-type mouse strain for this study. Igf1 
conditional mice” and Hoxb6-Cre transgenic mice were previously described. 
Jerboas were housed and reared as previously described**. All animal protocols 
were approved by the Harvard Medical Area Standing Committee on Animals. 
Sectioning and histology. Dissected skeletal elements were fixed overnight at 4 °C 
in 4% paraformaldehyde and then carried through a graded series of ethanol 
dehydration washes before transition through xylenes and into paraffin wax. 
Sections were cut at 10-12-|1m thickness and stained with haematoxylin and eosin. 
Average maximum cell heights were measured in the axis of linear growth through 
the lacunae surrounding the largest cells with a clear nuclear profile from digital 
images of the hypertrophic zone and averaged across at least 4 sections from at least 
3 individuals. BrdU (100 mg kg *) or oxytetracycline hydrochloride (20 mg kg) 
was injected into the peritoneum of P5 mice before harvest. BrdU was detected 
using a rat anti-BrdU antibody (AbD, Serotech) followed by goat anti-rat Alexa594 
(Invitrogen) in paraffin sections. Oxytetracycline was detected by fluorescence in 
bisected skeletal elements. 

Chondrocyte and megakaryocyte isolation. P5 animals were chosen for this 
study because the tibia and metatarsal growth plates are rapidly elongating in both 
species, but the metatarsal epiphysis (secondary ossification centre) has not yet 
formed. While the epiphysis of the metatarsal forms by P7 in the mouse, it appears 
later in the jerboa (Supplementary Fig. 3). As the hypertrophic chondrocytes reside 
in a small domain nearest the chondro-osseous junction, we enriched for these 
cells by using a razor blade to remove a majority of the cartilage containing resting 
and proliferative chondrocytes leaving the cells closest to the chondro-osseous 
junction and a small amount of the adjacent trabecular bone. Growth plates were 
bisected longitudinally and incubated for 45 min at 37 °Cin 2 mgml ' collagenase 
D (Roche) in DMEM/F12 (Invitrogen, 290-330 mOsm) plus 10% fetal calf serum. 
After the initial incubation, the bone collar, trabecular bone and loosened con- 
nective tissues were manually removed with forceps, and the remaining cartilage 
fragments were transferred to a fresh dish of collagenase digestion media. Cells 
were incubated for an additional 2-3 h with occasional swirling until cells mostly 
dissociated from the surrounding matrix. Dissociated chondrocytes were trans- 
ferred to 35-mm dishes with a 14-mm diameter, 1.5-thickness glass bottom 
(MatTek) and imaged immediately (see Supplementary Methods for details of 
imaging). The largest hypertrophic chondrocytes are a small population of the 
most mature cells, therefore we further enriched for this population in our data 
analysis by scanning for fields of view containing the largest cells in the dish and 
quantified all of the intact spherical neighbours. 
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There has been a longstanding discussion in the literature regarding the extra- 
cellular osmolarity of chondrocytes, primarily in the articular cartilage?'°*”®. 
Much of this data are based on the theoretical ionic environment in association 
with charged glycosaminoglycans according to the Donnan-Gibbs equilibrium, 
although to our knowledge the extracellular osmolarity of growth plate cartilage 
has not been directly measured. While serum osmolarity is approximately 
280mOsm, the osmolarity in association with cartilage may be upwards of 
400 mOsm. To address the possibility that the swelling we observe in chondrocytes 
at larger volumes may be the response to media of low osmolarity, we repeated the 
diffraction phase microscopy measurements in mouse proximal tibia growth 
plates dissociated in 424 mOsm DMEM/F12 raised with sucrose. Media osmolar- 
ity was measured using the Vapro Model 5600 (Wescor). We find the same three 
phases, including the phase of cell swelling, indicating this is an inherent property 
of growth plate chondrocytes and not the passive response to an abnormal osmotic 
environment (Supplementary Fig. 6). We presume that the 4h from dissection to 
imaging is enough time for volume regulatory mechanisms to compensate for any 
response to osmotic stress that may have occurred. Indeed, 4D confocal imaging of 
in situ porcine articular chondrocytes after osmotic stress shows a mean recovery 
rate of 4.1% + 1.8% per min with 96% volume recovery after about 12 min’’. 

Fetal megakaryocytes were isolated from embryonic day 14.5 mouse livers 
according to previously published protocols” or generously donated by A. 
Luyten and R. Shivdasani. Cells were imaged by diffraction phase microscopy 
one day after harvest or at maturity after five days in culture. 
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Brown-fat paucity due to impaired BMP signalling 
induces compensatory browning of white fat 


Tim J. Schulz'}, Ping Huang’, Tian Lian Huang', Ruidan Xue’, Lindsay E. McDougall!, Kristy L. Townsend', Aaron M. Cypess', 


Yuji Mishina*, Emanuela Gussoni* & Yu-Hua Tseng"” 


Maintenance of body temperature is essential for the survival of 
homeotherms. Brown adipose tissue (BAT) is a specialized fat tis- 
sue that is dedicated to thermoregulation’. Owing to its remark- 
able capacity to dissipate stored energy and its demonstrated 
presence in adult humans”*, BAT holds great promise for the 
treatment of obesity and metabolic syndrome’. Rodent data sug- 
gest the existence of two types of brown fat cells: constitutive BAT 
(cBAT), which is of embryonic origin and anatomically located in 
the interscapular region of mice; and recruitable BAT (rBAT), 
which resides within white adipose tissue (WAT)® and skeletal 
muscle’, and has alternatively been called beige’, brite’ or inducible 
BAT”. Bone morphogenetic proteins (BMPs) regulate the forma- 
tion and thermogenic activity of BAT'®’*. Here we use mouse 
models to provide evidence for a systemically active regulatory 
mechanism that controls whole-body BAT activity for thermore- 
gulation and energy homeostasis. Genetic ablation of the type 1A 
BMP receptor (Bmprla) in brown adipogenic progenitor cells 
leads to a severe paucity of cBAT. This in turn increases sympath- 
etic input to WAT, thereby promoting the formation of rBAT 
within white fat depots. This previously unknown compensatory 
mechanism, aimed at restoring total brown-fat-mediated thermo- 
genic capacity in the body, is sufficient to maintain normal tem- 
perature homeostasis and resistance to diet-induced obesity. These 
data suggest an important physiological cross-talk between con- 
stitutive and recruitable brown fat cells. This sophisticated regu- 
latory mechanism of body temperature may participate in the 
control of energy balance and metabolic disease. 

It has recently become clear that cBAT shares a common deve- 
lopmental ancestry with skeletal muscle'*"*, whereas rBAT, localized 
within white fat or skeletal muscle, is derived from a non-myogenic 
lineage’*’. Lineage-tracing experiments have also revealed that cCBAT 
arises from progenitors located in the embryonic dermomyotome** 
that express the myogenic markers Pax7 and Myf5 (refs 13, 16). 
Thus, we generated a mouse model lacking BMPRIA in all cells des- 
cending from the Myf5* lineage (Myf5-BMPRI1A-KO). No apparent 
changes in morphology, proliferation or apoptosis were observed dur- 
ing early embryonic stages (Supplementary Figs 1 and 2). Histological 
evidence of reduced cBAT formation in Myf5-BMPR1A-KO mice was 
observed at embryonic day (E)16.5 and persisted until after birth 
(postnatal day (P)1; Fig. la-c). cBAT arises from highly proliferative 
fibroblasts during late gestational stages’’. At E16.5, developing cBAT 
stains strongly for the proliferation marker Ki67, which was markedly 
decreased in Myf5-BMPRIA-KO embryos (Fig. 1d and Supplemen- 
tary Fig. 2b, d). Apoptosis levels were unchanged in knockout mice 
compared to controls throughout embryogenesis (Supplementary 
Fig. 2c), suggesting that reduced proliferation occurring before or 
around E16.5 is responsible for the defective formation of cBAT in 
knockout animals. 


Myf5-BMPRIA-KO mice were born runted and stayed smaller 
throughout life (Fig. le, f). Importantly, the reduction of cBAT mass 
remained highly significant in adult mice (Fig. 1g, h). Despite this, 
the gene-expression pattern of the residual cBAT appeared normal, 
apart from a moderate reduction of Bmprla gene expression (Sup- 
plementary Fig. 3). The sizes of interscapular WAT (iWAT) and 
retroperitoneal WAT (rWAT)—two white-fat depots that contain 
subpopulations of cells from the Myf5~ lineage’*—were also reduced 
in the knockout mice (Supplementary Fig. 4a). Gene expression in 
iWAT showed a trend towards reduced expression of BAT genes, 
but there were no changes in the expression of general white adipo- 
genic genes (Supplementary Fig. 4b). Subcutaneous WAT (sWAT) 
and epididymal WAT (eWAT), both mostly originating from a 
Myf5~ lineage’*, were not decreased in size and expressed normal 
levels of all type-1 BMP receptors (Fig. 1g, h and Supplementary 
Fig. 3a—c). Expression of Bmpr1a in skeletal muscle, on the other hand, 
was reduced by >95% (Supplementary Fig. 3a). Upon normaliza- 
tion to body weight, we found that limb skeletal muscle size was 
unchanged, whereas the function of myogenic progenitors was altered 
(P.H. and E.G., unpublished data). Thus, loss of BMP signalling in 
Myf5-expressing cells specifically targets the formation of cBAT. 
During embryogenesis, MyoD™ progenitors emerge after the Myf5~ 
progenitors’’. MyoD-Cre-driven Bmprla knockout mice showed 
completely normal development of cBAT and WAT depots (Sup- 
plementary Fig. 5), suggesting that the developmental divergence 
between myogenic and brown adipogenic progenitors takes place 
before the emergence of MyoD* progenitors, or that BMPRIA is 
not required for cBAT formation during this particular developmental 
stage. 

A very similar phenotype was observed in a second mouse model 
with conditional deletion of Bmpr1a in all types of adipocytes (FABP4- 
BMPRI1A-KO). Loss of Bmprla led to a specific paucity of cBAT as 
well as WAT-resident rBAT (Supplementary Fig. 6). Because the 
FABP4-Cre driver is active at later adipogenic stages compared to 
Myf5-Cre, we conclude that signalling through BMPR1A is essential 
also for later stages of brown adipogenesis. 

Next, we isolated progenitors from either Myf5~ sWAT or Myf5~ 
cBAT to test their cell-autonomous ability to produce brown adipo- 
cytes’®. Cells derived from sWAT differentiated normally (Supplemen- 
tary Fig. 7). By contrast, the frequency and ability of cBAT-derived 
Myf5*/Scal*/CD31~ (Scal also known as Ly6a; CD31 also known as 
Pecam1) progenitors to differentiate into mature brown adipocytes was 
significantly reduced (Fig. 2a, b). We therefore generated brown pre- 
adipocytes from cBAT completely lacking BMPRIA (Supplementary 
Fig. 8a, b). Loss of Bmpr1a resulted in a marked inhibition of differenti- 
ation (Fig. 2c), and reduced ability to respond to BMP7-induced pho- 
sphorylation of SMAD and p38-mitogen-activated protein kinase 
(p38MAPK; also known as MAPK14)”’, which are downstream targets 
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Figure 1 | Loss of Bmpria impairs cBAT 
formation by decreasing embryonic progenitor 
proliferation. a, b, Haematoxylin and eosin 
staining of interscapular sections at embryonic 
stage E16.5 and in newborns (P1). Photographs 
(original magnification, 100) of the same 
anatomical location of transversal sections were 
acquired for control (CON) and Myf5-BMPRIA- 
KO mice (KO). Asterisks indicate spinal cord, 
orange lines indicate brown fat. c, Macroscopic 
images of cBAT of P1 newborns. d, Ki67 
immunofluorescence (green) of cBAT at E16.5 and 
co-localization with nuclear 4',6-diamidino-2- 
phenylindole (DAPI) stain (blue, left panels; 
analysis of Ki67* nuclei in Supplementary Fig. 1d), 
and light microscopy of the same area (right panels: 
original magnification, x 400). Orange lines 
indicate brown fat. e, f, Body weight analysis of 
control (C) and Myf5-BMPR1A-KO (K) mice from 
P1 newborns (e; control: n = 12; knockout: n = 4) 
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Figure 2 | Ablation of Bmprla in brown pre-adipocytes derived from the 
Myf5* lineage inhibits differentiation. a, Relative frequency of Scal"/ 
CD31 /CD11b /CD45~ adipogenic progenitors of the Myf5-Cre(YFP) * 
lineage in cBAT after normalization to tissue weight (n = 6). White bars 
indicate cells isolated from control mice (C), black bars indicate Myf5- 
BMPRI1A-KO cells (K). b, Quantitative polymerase chain reaction (qPCR) 
analysis of Ucp1, Cidea and Bmpr 1a messenger RNAs in fluorescence-activated 
cell sorting (FACS)-purified primary progenitors from cBAT after adipogenic 
differentiation (n = 3). Asterisks denote significant differences between 
genotypes: *P < 0.05. ¢, Triglyceride-specific Oil-Red-O staining of 
differentiated control (C) or BMPR1A-deficient (K) immortalized pre- 
adipocyte cell lines pretreated with vehicle (Veh) or BMP7 (B7). d, Western blot 
of phospho (p)-SMAD1/5/8, basal SMAD1, corresponding «-tubulin, 
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phospho-p38MAPK, basal p38MAPK, and corresponding o-tubulin in 
undifferentiated control (CON) and pre-adipocytes lacking BMPR1A (KO) 
following exposure to BMP7 for 10, 30 and 60 min. Representative blots from 
three independent experiments are shown. e, PCR of Zfp423, Cebpa and Pparg 
in undifferentiated brown pre-adipocytes after 24h exposure to BMP7. f, qPCR 
of Ucp1, Prdm16 and Pparg isolated from mature brown adipocytes. Pre- 
adipocytes were treated with BMP7 for 3 days, followed by 7-day 
differentiation. All experiments were performed in triplicate and are presented 
as mean + s.e.m. e, f, Statistically significant differences as determined by 
analysis of variance (ANOVA; P < 0.05) were: control (vehicle) versus control 
(BMP7) (*); KO (vehicle) versus KO (BMP7) (+); control (vehicle) versus KO 
(vehicle) (£); and control (BMP7) versus KO (BMP7) (8). 
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of BMP signalling (Fig. 2d). This led to a concomitant decrease in the 
expression of key adipogenic transcription factors zinc-finger pro- 
tein 423 (Zfp423)"', CCAAT/enhancer-binding protein « (Cebpa) and 
peroxisome-proliferator-activated receptor  (Pparg)”* in undifferenti- 
ated pre-adipocytes (Fig. 2e). After adipogenic differentiation, BMPR1A- 
knockout cells showed severely reduced expression of BAT markers 
Ucp1, PR-domain containing 16 (Prdm16)'* and Pparg (Fig. 2f), even 
in the presence of BMP7. Aside from BMPRIA, BMPRIB and 
activin A receptor, type 1 (ACVRI1) are the other two major type-1 
BMP receptors”. Whereas cBAT developed normally in whole-body 
BMPRI1B-knockout mice (Supplementary Fig. 9a, b), deletion of Acvr1 
in the Myf5* lineage resulted in a severe reduction of cBAT mass 
(Supplementary Fig. 9c, d), indicating that both ACVR1 and BMPRIA 
are essential for development of cBAT. Accordingly, pre-adipocytes lack- 
ing Acvr1 showed a somewhat milder phenotype compared to BMPR1A- 
deficient cells, whereas in double-knockout cells, brown adipogenesis was 
completely abolished (Supplementary Fig. 8c-g). 

As BAT has a key role in thermoregulation, one would anticipate 
that the Myf5-BMPR1A-KO mice would have reduced body temper- 
ature as a consequence of impairment in cBAT development. Indeed, 
newborn Myf5-BMPR1A-KO mice, with their unfavourable surface- 
to-volume ratio, showed a significant reduction in body temperature 
(Fig. 3a, b). This reduction in body temperature was, surprisingly, no 
longer present in adult Myf5-BMPR1A-KO mice maintained at nor- 
mal ambient temperatures of 22 °C (Fig. 3c, bars labelled 22 °C), indi- 
cating a compensatory mechanism that restored thermogenic capacity. 
Non-shivering thermogenesis is a critical response to prolonged cold 
exposure in order to maintain body temperature**. When exposed to 
acute and chronic cold challenges, control mice were able to quickly 
return to normal body temperature after 48h of cold exposure, sug- 
gesting rapid activation of non-shivering thermogenesis via cBAT, and 
possibly other short-term measures such as muscle shivering, which 
are less pronounced in newborn mice. By contrast, Myf5-BMPRIA- 
KO mice showed a reduction in body temperature after 2 h and 48 h of 
cold exposure at 5 °C, presumably due to the paucity of cBAT (Fig. 3c, 
bars labelled 5 °C/2 h and 5 °C/48 h), as we did not observe any abnor- 
mal behaviour, such as increased shivering, under cold exposure. 
Despite this, body temperature in Myf5-BMPRIA-KO mice returned 
to normal after prolonged cold exposure (that is, 11 days), strongly 
suggesting an adaptive recruitment of rBAT to cope with lower ambi- 
ent temperatures (Fig. 3c, bars labelled 5 °C/11 d). Accordingly, knock- 
out animals showed a marked increase in UCP1 protein expression in 
sWAT (Fig. 3d and Supplementary Fig. 10). This browning effect could 
be further enhanced in sWAT, and induced in eWAT, by administra- 
tion of the B3-adrenergic receptor agonist CL-316,243, as indicated by 
significantly increased expression of BAT markers Ucp1 and cell- 
death-inducing DFFA-like effector a (Cidea) in Myf5-BMPRIA-KO 
mice (Fig. 3e-h), as well as by increased emergence of multilocular 
UCP1* adipocytes in WAT (Fig. 3i, j). 

rBAT is sensitive to inductive factors, such as BMP7 (refs 10, 11), 
BMP8b”, fibroblast growth factor 21 (FGF21)* and the myokine 
irisin® (which is the processed and secreted form of the Fndc5 gene 
product), among others. However, gene expression analysis revealed 
no changes in any of these (Supplementary Fig. 11), suggesting that the 
compensatory browning is not mediated by these factors. Because 
thermogenesis is rigorously controlled by the sympathetic nervous 
system’, we quantified sympathetic input to white fat in Myf5- 
BMPRI1A-KO mice. Staining for tyrosine hydroxylase was significantly 
increased in sWAT of knockout mice (Fig. 3k, 1). Moreover, circulating 
levels of noradrenaline were also significantly elevated in knockout 
mice, suggesting that increased sympathetic input may contribute to 
the browning of WAT in Myf5-BMPRIA-KO mice (Fig. 3m). Addi- 
tionally, cold-exposed Myf5-BMPR1A-KO mice showed normal nor- 
adrenaline-induced thermogenic capacity (Fig. 3n and Supplementary 
Fig. 12), and thus possess a sufficient ability to compensate for loss of 
cBAT. These findings suggest that both types of brown fat may possess 
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Figure 3 | Specific ablation of cBAT results in a compensatory response of 
increased browning by enhanced sympathetic input to WAT. 
a, Photographic (left) and infrared (right) images of P4-P6 newborns. 
Knockout pups are indicated by white circles. b, Quantification of average skin 
surface temperature from infrared images. White bars indicate control mice 
(C), black bars indicate Myf5-BMPR1A-KO mice (K; control: n = 12; 
knockout: n = 10). ¢, Body core temperature of adult control and Myf5- 
BMPR1A-KO mice. Measurements were performed at room temperature 
(22 °C; control: n = 11; knockout: n = 10), and after 2h (n = 5), 48h (control: 
n= 7; knockout: n = 6) and 11 days (control: n = 7; knockout: n = 6) of cold 
exposure (5 °C). d, Western blot analysis of UCP1 in sWAT and eWAT. 
e, f, GPCR of Ucp1 gene expression in sWAT (e) and eWAT (f). Grey bars 
indicate controls after administration of §3-adrenergic agonist CL-316,243 
(1mgkg | body weight) for 10 days, red bars indicate CL-316,243-injected 
Myf5-BMPR1A-KO mice (n = 7; applies to all panels). g, h, qPCR of Cidea in 
sWAT (g) and eWAT (h). i, Haematoxylin and eosin staining showing 
morphology of sWAT and eWAT after administration of CL-316,243 (original 
magnification, <200). j, UCP1 immunofluorescence of sWAT and eWAT after 
administration of CL-316,243 (original magnification, < 400). Left panels show 
UCP1 immunofluorescence (green), right panels show light microscope image 
of the same area. k, 1, Tyrosine hydroxylase immunofluorescence and 
quantification of tyrosine-hydroxylase-positive nerve fibres in sWAT after 
administration of CL-316,243 (original magnification, *400) (n = 3). 
m, Serum levels of noradrenaline (n = 8). n, Time course of noradrenaline- 
induced oxygen consumption in mice maintained at 5 °C for 8 days (grey line, 
control; black line, knockout; n = 5). All data are presented as mean + s.e.m. 
Asterisks denote significant differences between genotypes: *P < 0.05; 
**P < 0.01; ***P < 0.001. 
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Figure 4 | Loss of sympathetic innervation causes atrophy of cBAT and 
compensatory browning of white fat. a, Tissue weight of cBAT, sWAT and 
eWAT normalized to body weight in sham-operated mice (white bars) and 
mice after surgical denervation of cBAT (black bars). b-d, qPCR of Ucp1 in 
cBAT (b), sWAT (c) and eWAT (d) of control mice, denervated mice, controls 
(grey bars) or denervated (red bars) mice injected with CL-316,243. S, sham- 
operated mice; D, surgically denervated mice. i.p., intraperitoneal. 

e, Haematoxylin and eosin staining of sWAT from sham-operated and cBAT- 
denervated mice after CL-316,243 injections. Scale bar, 100 um. All data are 
presented as mean + s.e.m. (n = 4). Asterisks denote significant differences 
between denervation and respective control groups: *P < 0.05 as determined 
by ANOVA. 


similar capacities for thermoregulation if maximally stimulated. 
Whereas cBAT is essential during acute cold challenges, compensatory 
rBAT in the knockout mice with severe paucity of cBAT has a criti- 
cal role in maintaining normal body temperature, especially during 
long-term cold exposure. In accordance with these findings, Myf5- 
BMPRIA-KO animals were resistant to diet-induced obesity, even 
under obesity-promoting thermoneutrality conditions, where mice 
no longer require thermogenesis to maintain body temperature’* (Sup- 
plementary Fig. 13). 

To determine whether the effect of compensatory browning is also 
present in other models of cBAT atrophy and independent of gene- 
tic intervention that could also affect skeletal muscle, we surgically 
interrupted innervation of cBAT in wild-type mice. Denervation of 
cBAT resulted in a significant decrease of cBAT size and a 68% reduc- 
tion of Ucpl expression in cBAT (P= 0.0029; Fig. 4a, b). As in 
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Myf5-BMPRIA-KO mice, atrophy of cBAT resulted in increased 
recruitment of brown adipocytes in WAT (Fig. 4c-e), thus reinforcing 
the notion of a systemic mechanism that regulates total BAT-mediated 
thermogenic capacity. 

It has been documented before that surgical removal of the inter- 
scapular depot of cBAT causes activation of the remaining depots of 
cBAT”. We used both genetic and surgically generated models of cBAT 
paucity to demonstrate the existence of a physiological mechanism to 
ensure thermoregulation by compensatory browning of WAT. This 
type of BAT may be more closely related to that found in adult humans’. 
The system inducing formation of rBAT seems to involve cBAT-brain 
and brain-WAT communication that is mediated, at least in part, by 
the sympathetic nervous system. Interestingly, obesity resistance in 
mice appears to be mostly related to browning of white fat, rather than 
adaptive thermogenesis of cBAT”, altogether suggesting that rBAT is a 
key contributor to metabolic health. The findings presented here sug- 
gest that any therapeutic approach involving rBAT must take into 
account the tight regulation of total-BAT-mediated thermogenic capa- 
city and systemic energy metabolism at both peripheral and possibly 
also central levels. Targeting these mechanisms, for instance by modi- 
fying BMP signalling to regulate BAT mass and activity, could be a 
feasible approach to develop obesity therapies. 


METHODS SUMMARY 


Animal models. For conditional knockouts, strains expressing Cre recombinase 
under the control of Myf5, Myod or fatty-acid-binding protein 4 (Fabp4) promo- 
ters were crossed to mice carrying floxed alleles for Bmprla or Acvr1 (Myf5-Cre 
only). For some experiments, Myf5-BMPR1A-KO mice were crossed to a Rosa26- 
YFP reporter strain. For Bmpr 1b, a strain with whole-body gene deletion was used. 
Denervation of cBAT in male, wild-type C57BL/6 mice was performed by isolating 
and removing a portion of about 1-2 mm from the intercostal nerve bundles 
innervating both lobes of the interscapular cBAT. 

Thermal imaging of skin-surface temperature. Measurement of skin temper- 
ature was performed using a thermal imaging camera (T300 InfraRed Camera; 
FLIR Systems) on newborn mice at ages P4—P6, before onset of hair growth. 
Statistical analysis. Results are expressed as mean + s.e.m. Differences between 
two groups were assessed using unpaired two-tailed Student’s t-test or U-Mann- 
Whitney test. Data involving more than two groups were assessed by ANOVA. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Animals. All animal procedures were approved by the Institutional Animal Use 
and Care Committee at Joslin Diabetes Center. Transgenic mice carrying floxed 
alleles for Acvr1 (ref. 31) or Bmprla (ref. 32) were used to generate conditional 
gene deletion mouse models by intercrossing with either Myf5- (ref. 33) or FABP4- 
driven** Cre expression as indicated. For studies involving Bmpr 1b deletion, ani- 
mals with whole-body gene deletion of this receptor were used**. Myf5-Cre- 
expressing animals were also crossed to Rosa26-YFP reporter mice (The 
Jackson Laboratory) in addition to the Bmpr1a-flox alleles for Myf5-lineage tra- 
cing and GFP labelling of Bmprla-deficient cells as described before'®. For geno- 
typing, DNA was isolated from tail tip biopsies by boiling in 0.5 ml 50 mM NaOH 
for 15 min, followed by neutralization by addition of 50 pil 1 M Tris-base (pH 6.8) 
and thorough vortexing. PCR genotyping was performed using primers as listed in 
Supplementary Table 1. Expected band sizes for Bmpr1a during gel analysis were 
180 bp for the wild-type allele and 230 bp for the floxed allele. For recombination 
analysis of Bmprla mRNA, expected band sizes were 396 and 233 bp for the intact 
and recombined alleles, respectively, and 178 bp for the control PCR of exons 6 to 
7. For Acvr1 genotyping, the PCR product was subsequently digested using the 
restriction enzyme Bgl-I (New England Biolabs) at 37°C overnight, yielding a 
band at 250 bp for wild-type mice, and a double band at 90 bp and 160bp for 
floxed alleles (all three bands in heterozygotes). Expected band sizes for Bmpr1b 
were 350 bp for the wild type, and 300 bp for the null allele. Expected band sizes for 
the Rosa26-YFP reporter mice were 600 bp for the wild-type allele and 320 bp for 
the mutant allele. For Myf5-Cre genotyping, bands at 600 and 400 bp for wild-type 
and mutant alleles, respectively, were expected. For a general Cre PCR, a single 
band at approximately 350 bp indicated presence of Cre cDNA in the genome. For 
loading control, IRS-1 primers were added for co-amplification in the same reac- 
tion (band at approximately 500 bp) to ensure proper loading with template DNA. 
To stimulate the browning of white adipose depots, mice were treated with daily 
intraperitoneal injections of 1 mgkg  * bodyweight CL-316,243 (Sigma-Aldrich) 
dissolved in PBS (also used for control injections) for up to 10 days. For experi- 
ments involving high-fat-diet feeding, 4- to 6-week-old animals were placed on a 
diet containing 45 or 60% kilocalories from fat (Research Diets). For cold exposure 
and thermoneutral conditions, animals were housed at 5 °C or 30 °C, respectively, 
for the indicated times in a controlled environmental chamber (Caron Products & 
Services) with free access to food and water. Body core temperature was deter- 
mined by rectal probe measurements. 

Haematoxylin and eosin staining. Sections were prepared, processed and stained 
as described". 

Immunofluorescence. Sections were deparaffinized and prepared as described'’. 
Primary antibodies were incubated overnight at 4 °C: Ki67 (1:200, rabbit polyclo- 
nal; Abcam); UCP1 (1:50, rabbit polyclonal; AnaSpec), tyrosine hydroxylase (1:50, 
rabbit polyclonal; Millipore), GFP (1:200, goat polyclonal; Novus USA). After 
primary antibody incubation, the sections were washed and incubated with appro- 
priate secondary antibody (Alexa Fluor-488 (green) or -594 (red); Invitrogen) at a 
1:200 dilution for 10 min in the dark. After secondary antibody incubation, sec- 
tions were washed with distilled water for DAPI staining (0.1 jg ml in water for 
5-10 min in the dark), and mounted. Sections were kept in the dark after mounting 
and analysed by confocal microscopy on a Zeiss LSM-410 Invert Laser Scan 
Microscope (Carl Zeiss Microlmaging, Thornwood, NY), or using a fluorescence 
microscope (Olympus BX60F-3; Olympus Corporation). Quantification of tyro- 
sine hydroxylase was performed by using the Image] software. Identical conditions 
and settings were used for picture acquisition and analysis. A threshold was set for 
each image to eliminate background and to create a binary mode image. A mini- 
mum particle size of 20 pixels was used as exclusion criteria to eliminate unspecific 
background and for quantification of areas that stained positive for tyrosine 
hydroxylase. For quantification of Ki67 staining, Ki67* nuclei were counted in 
areas identified as BAT by microscopic inspection of morphology and comparison 
to published sources*®, and normalized to the total number of DAPI" nuclei in the 
same area. For each section and animal, images from three representative areas 
were analysed. 

TdT-mediated dUTP nick end staining (TUNEL assay). For detection of DNA 
fragmentation, sections were deparaffinized and blocked for autofluorescence as 
described'®. Sections were blocked in 1% BSA, 0.5-1% Triton X-100 in PBS for 
1h, followed by washes in PBS. Terminal-strand labelling was performed for 1.5h 
at 37 °C in TdT buffer (30 mM Tris-HCl, 140 mM Na-Cacodylate, 1 mM cobalt (ID) 
chloride, pH 7.2) in the presence of dATP, biotinylated dUTP and terminal deox- 
ynucleotidyl transferase (all from Roche Applied Science). The reaction was stopped 
by immersion in 2X SSC buffer and subsequent incubation with Cy3-labelled strep- 
tavidin (1:200; Jackson ImmunoResearch Laboratories) for 1 h in the dark. Sections 
were then stained with DAPI, mounted, and analysed as described earlier. 

RNA and protein quantification. RNA extraction, cDNA synthesis, and quanti- 
tative real-time PCR (qPCR) were performed as described before’’. For qPCR 


analysis, C, values <30 were used for gene expression analysis. Protein detection 
by western blotting was performed as described before''. Primary antibodies were 
incubated overnight at 4°C: phospho-Smad-1/5/8 (1:1,000, rabbit polyclonal), 
SMAD1 (1:1,000, rabbit polyclonal), phospho-p38MAPK (1:1,000, rabbit poly- 
clonal), p38MAPK (1:1,000, rabbit polyclonal) (all from Cell Signaling 
Technologies), UCP1 (1:500, goat polyclonal; Santa Cruz Biotechnology) and 
a-tubulin (1:8,000, mouse monoclonal; Sigma-Aldrich). HRP-coupled secondary 
antibodies (Cell Signaling Technologies) were used at 1:2,000 dilutions at room 
temperature for 2 h followed by detection using the ECL system. 

Cell sorting. Scal* adipocyte progenitor cells were isolated from cBAT and 
sWAT of Myf5-BMPR1A-KO mice and control littermates, and differentiated 
as described before’®. 

Immortalized pre-adipocytes. Immortalized cell lines were generated as 
described previously*’. In brief, cCBAT from individual newborn pups (P1 or P2) 
of homozygous floxed parents for the respective BMP receptor was collected 
and pre-adipocytes were isolated by enzymatic digestion. Pre-adipocytes were 
immortalized by infection with SV40-expressing lentivirus and subsequent selec- 
tion with puromycin. Stable cell lines were then infected with adenovirus expres- 
sing either GFP (control) ora Cre::GFP fusion construct for in vitro recombination 
(Gene Transfer Vector Core, University of lowa). Forty-eight hours after infection, 
GEP* cells were collected by flow cytometry and expanded for subsequent use in 
experiments. DNA was isolated for PCR analysis to determine full recombination 
of the respective receptor gene using the primers as detailed earlier. 

Cell culture. Pre-adipocyte cell lines were cultured as described before"’, except 
that 2% fetal bovine serum was used during differentiation. BMP7 treatments 
(3.3 nM) were performed for 3 days before 48 h of adipogenic induction, followed 
by a differentiation period of 5 days. Oil-Red-O staining was performed as 
described before’. 

Thermal imaging of skin-surface temperature. Measurement of skin temper- 
ature was performed using a thermal imaging camera (T300 InfraRed Camera; 
FLIR Systems). Skin-surface temperature of newborns was analysed using FLIR 
Reporter 8.5 software (FLIR Systems). Images were acquired by placing newborn 
mice of the same litter (P4—P6) in 6-well cell-culture dishes, and 2-3 images of 
each mouse from different angles were acquired to minimize temperature varia- 
tions due to different postures of the animal. Software drawing tools were used to 
draw a region of interest (ROI) around the entire animal, and average body surface 
temperature was calculated using that ROI. For each animal, an average temper- 
ature value of the temperatures from all single images was calculated. These 
averages were then used for statistical analysis. 

Serum parameters. Serum levels of noradrenaline were determined using a com- 
mercially available ELISA kit and according to the manufacturer’s instructions 
(Norepinephrine (Research) ELISA; IBL America). To stabilize the noradrenaline, 
1mM EDTA and 4mM sodium metabisulphite were added to the serum. Serum 
was prepared by spinning freshly collected blood in a cooled centrifuge at 6,000g 
for 20 min. The clear supernatant was collected and stored at —80°C. Samples 
were analysed within 12 weeks after collection. To determine circulating irisin 
levels, a commercially available ELISA assay kit was used according to the manu- 
facturer’s specifications (Phoenix Pharmaceuticals). 

Body composition. Relative contents of lean and fat mass were determined using 
dual-energy X-ray absorptiometry (DEXA) according to the manufacturer’s 
instructions (GE Lunar PIXImus 2, General Electric Medical Systems). Animals 
were anaesthetized with pentobarbital (50 mgkg’ ', intraperitoneally) and placed 
in the scanning area to measure body composition. Relative lean and fat mass were 
calculated by normalizing to body weight. 

Noradrenaline-induced thermogenic capacity. Measurement of maximum 
thermogenic capacity was performed as described before with some modifica- 
tions'*’*. Animals were maintained in the cold as described earlier for 8 days 
before the experiment. All measurements were performed at room temperature. 
Oxygen consumption by indirect calorimetry was assessed using the Compre- 
hensive Lab Animal Monitoring System (CLAMS; Columbus Instruments) and 
the Oxymax for Windows software (version 4.58) for data analysis. Animals were 
anaesthetized with pentobarbital (80 mgkg ', intraperitoneally), and indirect 
calorimetry was performed immediately for 30min to record basal values of 
oxygen consumption. The sampling interval was set to 15 min to allow for stable 
oxygen consumption assessment throughout the experiment. After three data 
points were obtained, animals were briefly removed from the chamber and nor- 
adrenaline was injected subcutaneously (1 mg noradrenaline bitartrate per kg; 
Sigma-Aldrich), and oxygen consumption was recorded for another 90 min. To 
determine maximum noradrenaline-induced thermogenic capacity (AVO,), the 
average value for basal oxygen consumption before noradrenaline injection was 
calculated and subtracted from the average value of highest noradrenaline- 
induced oxygen consumption (fs and t7;). Areas under the curve (AUC) were 
calculated for the curves after noradrenaline injection. Data were presented either 
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not normalized, that is, per animal, or after normalization to total body weight, or 
total fat mass (from DEXA scan). 

Denervation of cBAT. Denervation of cBAT was performed as described before”*. 
C57BL/6J mice (The Jackson Laboratory) aged 6 weeks were used for denerva- 
tions. In brief, mice were anaesthetized and placed on a warm pad to maintain 
body temperature. Under a stereomicroscope, an incision was made posterior to 
the interscapular cBAT pad. Surrounding muscle and white fat was carefully 
moved to the side and the cBAT pad was turned upward to expose the five 
branches of the intercostal nerve bundles. Denervation was performed by isolating 
and cutting the nerve bundles and removing a portion of about 1-2 mm from each 
strand. Care was taken not to damage the adjacent blood vessels. The procedure 
was performed on left and right lobes of the interscapular brown fat. BAT pads, 
surrounding white fat, and muscle were placed in the original locations and the 
incision was closed using tissue adhesive glue (Vetbond, 3M Animal Care 
Products). The same procedure was performed for sham surgeries, except that 
nerve bundles were not cut. Animals were housed in single cages and monitored 
daily during the first week of recovery. After 10 weeks, animals were divided in 
four groups: (1) sham-operated mice receiving intraperitoneal vehicle injections 
(PBS); (2) sham-operated mice receiving injections of CL-316,243 as described 
earlier; (3) denervated mice receiving vehicle; or (4) CL-316,243. 

Statistical analysis. All statistical analyses were performed using the programs 
Excel (Microsoft), Statview (SAS Institute) and GraphPad Prism (GraphPad 
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Software). Statistical analyses were performed using two-tailed Student’s t-test 
or ANOVA for comparing the means of two or multiple groups, respectively. 
Nonparametric testing (U-Mann-Whitney test) was used where appropriate, that 
is, when normal distribution of sample sets was not evident. The means of two 
groups were considered significantly different when P < 0.05. 
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Natural RNA circles function as efficient 


microRNA sponges 


Thomas B. Hansen’, Trine I. Jensen!, Bettina H. Clausen’, Jesper B. Bramsen!*, Bente Finsen”, Christian K. Damgaard! 


& Jorgen Kjems'? 


MicroRNAs (miRNAs) are important post-transcriptional regu- 
lators of gene expression that act by direct base pairing to target 
sites within untranslated regions of messenger RNAs’. Recently, 
miRNA activity has been shown to be affected by the presence of 
miRNA sponge transcripts, the so-called competing endogenous 
RNA in humans and target mimicry in plants*’. We previously 
identified a highly expressed circular RNA (circRNA) in human 
and mouse brain*. Here we show that this circRNA acts as a miR-7 
sponge; we term this circular transcript ciRS-7 (circular RNA 
sponge for miR-7). ciRS-7 contains more than 70 selectively con- 
served miRNA target sites, and it is highly and widely associated 
with Argonaute (AGO) proteins in a miR-7-dependent manner. 
Although the circRNA is completely resistant to miRNA-mediated 
target destabilization, it strongly suppresses miR-7 activity, result- 
ing in increased levels of miR-7 targets. In the mouse brain, we 
observe overlapping co-expression of ciRS-7 and miR-7, particu- 
larly in neocortical and hippocampal neurons, suggesting a high 
degree of endogenous interaction. We further show that the testis- 
specific circRNA, sex-determining region Y (Sry)’, serves as a miR- 
138 sponge, suggesting that miRNA sponge effects achieved by 
circRNA formation are a general phenomenon. This study serves 
as the first, to our knowledge, functional analysis of a naturally 
expressed circRNA. 

To regulate mature miRNA activity, conceptual studies overexpres- 
sing miRNA target site concatamers, so-called miRNA sponges, were 
initially performed and shown to result in a loss of miRNA function 
accompanied with increased levels of endogenous targets’®. Sub- 
sequently, endogenously expressed linear RNAs have been shown 
to sequester and inhibit miRNA activity in plants (target mimicry)’ 
and mammals (competing endogenous RNA)*”’. Collectively, these 
data suggest that miRNA sponges are widespread regulators of 
miRNA activity in many eukaryotes. However, the miRNA sponges 
or miRNA-competing transcripts identified so far are expressed at 
low levels, contain a limited number of miRNA target sites, and, simi- 
lar to miRNA targets, are themselves subject to miRNA-mediated 
destabilization’. 

We recently identified a highly expressed endogenous circular RNA 
(circRNA) in human and mouse brain®, and demonstrated that the 
circRNA can be endonucleolytically cleaved by miR-671 in an AGO2- 
dependent manner; however, the function of this circRNA remained 
unknown. By searching for miRNA target sites in this circRNA, we 
identified 73 conventional seed-targets for miR-7. Plotting the target 
site position and nucleotide conservation, we observed that the target 
sites are selectively conserved compared to the adjacent regions in all 
eutherian mammals but not in marsupials (Fig. 1a). Although variable 
in size, this circRNA, ciRS-7, exists in the genome of all eutherian 
mammals investigated (Supplementary Fig. la). A computational 
motif analysis of human ciRS-7 using MEME” revealed a high preva- 
lence of miR-7 seed-targets with additional 3’-end base pairing; how- 
ever, they are mismatched at the central part of the duplex preventing 


miRNA-mediated endocleavage'*'* (Supplementary Fig. 1b). The miR- 
671 target site exhibits near-perfect complementarity and very little 
variation across species (Supplementary Fig. 1c), as well as high duplex 
stability compared to miR-7 (Supplementary Fig. 1d). Analysis of avail- 
able online AGO2 immunoprecipitation followed by high-throughput 
sequencing (HITS-CLIP) data from mouse brain“ revealed a high 
degree of AGO2 occupancy on the mouse variant of ciRS-7, with 
sequence reads dispersed throughout ciRS-7, and the highest read 
densities covering the region with the highest 8-nucleotide target site 
abundance (Supplementary Fig. 2a). Importantly, HITS-CLIP reads 
proximal to the nonlinear splice sites support the association between 
AGO2 and ciRS-7 (Supplementary Fig. 2b). This confirms our pre- 
vious observation showing a high expression of ciRS-7 in human and 
mouse brain® and other studies profiling brain-specific expression of 
miR-7 (ref. 15). 

To study the effect of ciRS-7 expression on miR-7 activity, we estab- 
lished a vector-based system expressing the circRNA. The biogenesis 
of circRNAs is at present unknown; however, previous studies have 
shown that exons flanked by inverted repeats induce circRNA forma- 
tion in vitro by nonlinear splicing"®, and this is thought to be essential 
for production of the circular Sry RNA*”’. Thus, we inserted the ciRS-7 
exon along with the endogenous flanking sequence into pcDNA3 
(pcDNA3-ciRS-7-ir; Fig. 1b). Subsequently, we copied part of the 
upstream flanking sequence and inserted it in an inverted orientation 
downstream (pcDNA3-ciRS-7; Fig. 1b). Transient expression of 
pcDNA3-ciRS-7, but not of the construct lacking the downstream 
inverted sequence (pcDNA3-ciRS-7-ir), resulted in the production 
ofacircRNA species that was indistinguishable from the endogenously 
expressed circRNA by northern analysis (Fig. 1c) and PCR with 
reverse transcription (RT-PCR; Supplementary Fig. 3a). In addition, 
we constructed a linear and polyadenylated miR-7 sponge containing 
all miR-7 sponge sites but excluding the splice sites and flanking 
sequences (ciRS-7-fs; Fig. 1b and Supplementary Fig. 4a). Enzymatic 
digestion of linear RNA by tobacco acid pyrophosphatase followed by 
either terminator exonuclease or RNase R cleavage confirmed a cir- 
cular structure of the RNA produced from ciRS-7 (Supplementary 
Fig. 3b, c). By co-expressing the circRNA- or the linear RNA- 
producing vectors with miR-7 or miR-769 expression vectors, ciRS-7 
levels remained unaffected, whereas the linear construct displayed 
miR-7 sensitivity and was lowered by more than 40 + 12% (mean + 
s.e.m.) across biological replicates (Fig. 1d and Supplementary Fig. 4b), 
presumably owing to miRNA-mediated activation of deadenylation 
and subsequent exonucleolytic degradation. This suggests that the 
target sites in ciRS-7 do not support endocleavage by miR-7, and that 
ciRS-7 is resistant to the conventional miRNA destabilization of 
mRNA and therefore not prone to miR-7-dependent regulation. 

To determine whether ciRS-7 serves as a binding platform for 
AGO2 and miR-7, we conducted Myc-AGO2 immunoprecipitation 
in HEK293 cells transiently co-expressing miR-7 or miR-769 as a 
control. By quantitative RT-PCR (qRT-PCR), endogenous ciRS-7 
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Figure 1 | Structure, conservation and heterologous expression of ciRS-7. 
a, Schematic representation of the genomic locus encoding human ciRS-7 and 
predicted miR-7 and miR-671 target sites are shown (grey-scaled according to 
target site strength). Mammalian conservation scores are depicted. The 
horizontal black bar represents the genomic location of ciRS-7. Magnification 
of a ciRS-7 sub-region is shown below. 8mer, 7mer-m8 and 7mer-1A targets 
denote conventional miRNA seed matches (specified in Supplementary Fig. 
la). b, ciRS-7 expression vectors are shown. ciRS-7-fs consists of the ciRS-7 
exon without flanking sequences and splice sites; ciRS-7-ir includes splice sites 
and the endogenous flanking genomic sequence (1 kilobase (kb) upstream and 
200 base pairs (bp) downstream). In ciRS-7, part of the upstream sequence has 
been inverted and inserted downstream, as illustrated by the directional bars. 


pull-down by AGO2 was specifically enriched more than 50-fold 
in miR-7-transfected cells compared to miR-769-transfected cells 
(Fig. 2a), strongly suggesting that miR-7 is facilitating AGO asso- 
ciation with ciRS-7. Furthermore, using a biotin-coupled miR-7 
mimic, we observed a more than sixfold enrichment of ciRS-7 in the 
miR-7-captured fraction compared to the negative control, biotiny- 
lated miR-138 (Fig. 2b). To strengthen the observed formation of 
a ternary complex between miR-7, AGO2 and ciRS-7, we devised a 
T7 transcript containing all putative miR-7 target sites in conjunc- 
tion with or without a streptavidin-aptamer motif'* (Supplementary 
Fig. 5a, top). As expected, by incubating the linear T7 transcripts with 
lysate from HEK293 cells expressing either miR-7 or control miR-769, 
AGO2 and miR-7 were efficiently extracted only from miR-7-containing 
cell lysate and in an aptamer-specific fashion (Supplementary Fig. 5a—c). 
This demonstrates that the miR-7 sites within ciRS-7 are able to facilitate 
a specific miR-7-AGO2 interaction in cell lysates. Taken together, we 
conclude that the widespread AGO occupancy is caused by miR-7- 
directed association of AGO2 proteins to the prevalent and conserved 
miR-7 target sites in the ciRS-7 RNA. 

To evaluate the subcellular localization of miR-7 and ciRS-7, we per- 
formed RNA- fluorescence in situ hybridization (FISH) in HEK293 
(Fig. 2c, d) and HeLa (Supplementary Fig. 6a, b) cells co-transfected 
with ciRS-7 and miR-7. This showed a large degree of overlap 
between ciRS-7 and miR-7 within distinct cellular bodies, suggesting 
co-localization. Immunofluorescence co-staining with the P-body- 
specific marker DCP1A (ref. 19) revealed that ciRS-7 co-localized with 
P-bodies; however, this was only observed when the circRNA was 
co-expressed with miR-7, suggesting that miR-7 compartmentalizes 
ciRS-7 to the bodies (Fig. 2d and Supplementary Fig. 6b). Endogenous 
ciRS-7 and miR-7 signals also overlapped in primary cells isolated 
from mouse brain, as determined by dual fluorophore RNA-FISH 
analysis (Supplementary Fig. 6c). 

To investigate whether miR-7 and ciRS-7 are globally co-expressed 
in the brain, we visualized their expression in the adult mouse brain 
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CMV, cytomegalovirus promoter; pA, polyadenylation signal; SA, splice 
acceptor; SD, splice donor. ¢, Northern blot with 5 1g of RNA from HEK293 
cells transfected with empty vector (EV; pcDNA3), ciRS-7-ir (pcDNA3-ciR7- 
ir), ciRS-7 (pcDNA3-ciRS-7) or total RNA from human brain. The blot was 
probed against ciRS-7 and 18S ribosomal RNA (loading control). Migration of 
circular ciRS-7 and vector-based non-circular product (asterisk) are marked. 
d, Northern blot with 10 ug (agarose northern, top two panels) or 20 ug (PAGE 
northern, bottom three panels) of RNA from HEK293 cells co-transfected with 
empty vector, ciRS-7-fs or ciRS-7, together with miR-7 or miR-769 expression 
vectors (pJEBB-7 or pJEBB-769, respectively). The blots were hybridized with 
probes as indicated. 18S rRNA and miR-15b serve as loading controls. 


using alkaline phosphatase-coupled miR-7 and ciRS-7 probes. The two 
RNAs showed a distinct neuronal expression (Fig. 2e) compared to 
mouse Gapdh mRNA (Supplementary Fig. 7a), comprising both 
cortical pyramidal neurons and interneurons (Fig. 2e; see legends to 
Fig. 2 and Supplementary Fig. 7 for further description of brain locali- 
zation). At higher magnification, it was clear that ciRS-7 exhibited 
dendritic translocation (Supplementary Fig. 7b). Using confocal visua- 
lization of dual fluorophore RNA-FISH (Supplementary Fig. 7c) or 
5-j1m thin adjacent brain slides (Fig. 2e, bottom, and Supplementary 
Fig. 7d, e), miR-7 and ciRS-7 displayed clear co-expression in cells. The 
high expression of ciRS-7 coincides with miR-7 expression in mouse 
brain sections and in primary cells isolated from mouse brain, which 
strongly suggests that miR-7 is interacting endogenously with ciRS-7. 

To test the effect of ciRS-7 expression on miR-7 activity, we con- 
structed two luciferase reporters to monitor miR-7 activity by inserting 
either a perfect miR-7 target (Fig. 3a) or the entire ciRS-7 sequence 
(Fig. 3b) into the 3’-untranslated region (UTR) of Renilla luciferase. 
Co-transfecting the luciferase reporter with miR-7 and ciRS-7 expres- 
sion vectors showed a substantial reduction of knockdown potential 
(ranging from two- to eightfold) when ciRS-7 was co-introduced with 
miR-7 (Fig. 3a, b). Comparably, the non-circRNA-producing vector 
ciRS-7-ir had little effect on knockdown; however, this may reflect low 
RNA production or inefficient circularization. Notably, the expression 
of ciRS-7 proved more efficient than a conventional anti-miRNA 
approach, suggesting that the ciRS-7 molecule is indeed a highly func- 
tional miRNA sponge. Similarly, using the 3’- UTR of human EGFR, 
which is a known target for miR-7, in a luciferase reporter assay, ciRS-7 
reduced the knockdown efficiency exerted by miR-7 effectively (Sup- 
plementary Fig. 8a). Next, we evaluated the effect of ciRS-7 expression 
on miR-7-dependent regulation of endogenous targets. To this end, a 
single copy of the ciRS-7 expression vector was stably inserted into 
HeLa Flp-In cells that otherwise have no or very little endogenous 
expression of ciRS-7. Analysing the expression levels of the con- 
structed HeLa cells by northern and qRT-PCR analysis revealed 
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Figure 2 | Interaction between ciRS-7 and miR-7. a, Immunoprecipitation of 
Myc-tagged AGO2 from HEK293 cells transfected with either Myc-AGO2 plus 
pJEBB-7 (miR-7) or with Myc-AGO2 plus pJEBB-769 (miR-769). Empty 
vector serves as a Myc-AGO2 negative control. ciRS-7 and human GAPDH 
mRNA levels were quantified by (RT-PCR, and the relative immunoprecipitate 
(IP)/input ratios are plotted. Error bars represent s.d. (n = 4). b, The 3’-end 
biotinylated miRNA duplexes were transfected into HEK293 cells. After 
streptavidin capture, the input and bound fractions were evaluated by qRT- 
PCR (analysed as ina). c, RNA-FISH on HEK293 cells co-transfected with ciRS- 
7 (pcDNA3-ciRS-7) and miR-7 (pCK-7). ciRS-7 and miR-7 probes were 
labelled with Cy5 and Cy3, respectively. Scale bar, 5 um. d, Combined 
immunofluorescence and RNA-FISH on ciRS-7-transfected HEK293 cells with 
or without miR-7 (pCK-7) co-transfection immunostained with human 


ciRS-7 levels comparable to the endogenous levels observed in 
HEK293 cells, although still approximately 100-fold less than the total 
brain levels (Supplementary Fig. 9a). 

As in HEK293 cells, the stably expressed ciRS-7 in HeLa cells was 
sensitive towards miR-671 as shown previously*, but remained resis- 
tant to miR-7 (Supplementary Fig. 9b-d). Transfecting miR-7 in a 
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DCP1A antibody and hybridized with Cy5-labelled ciRS-7 probe. Scale bars, 
5 tum. e, Mouse brains sections hybridized with probes specific for ciRS-7 (left) 
or miR-7 (right) showing intense neuronal expression throughout the 
neocortex (NCx), hippocampal subregions CA1 and CA3, and dentate gyrus 
(DG) (top and middle). Notably, miR-7 is also expressed in glial cells in the 
corpus callosum (cc, middle). Bottom panels show mouse brain in situ 
hybridization performed on apposing surfaces of 5-j1m thin adjacent sections. 
Arrows point to large neurons that based on anatomical landmarks are present 
in both hybridizations. g, granule cell layer; h, dentate hilus; Im, stratum 
lacunosum-moleculare; m, stratum moleculare; 0, stratum oriens CA1; p, 
pyramidal layer; r, stratum radiatum CA1. Scale bars, 200 1m (top), 100 ym 
(middle) and 50 ttm (bottom). 


dose-gradient manner revealed that established miR-7 targets (such as 
SNCA”, EGFR and IRS2 (ref. 22)) responded more efficiently in the 
empty cell line without ciRS-7 expression (HeLa-EV) compared to 
ciRS-7-expressing cells (HeLa-ciRS-7) (Fig. 3c-e). In addition, when 
transfecting miR-671 into HeLa-ciRS-7 before introducing dose- 
gradient miR-7 levels, the evaluated target genes significantly regained 
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Figure 3 | ciRS-7 acts as a sponge for miR-7 activity. a, b, Luciferase reporter 
assays using reporter constructs with either a perfect miR-7 target site (a) or the 
entire ciRS-7 sequence (b). HEK293 cells in 12-well dishes were transfected 
with 0.5 lig empty vector (pcDNA3), ciRS-7-ir (pcDNA3-ciR7-ir), ciRS-7 
(pcDNA3-ciRS-7), 50 nM anti-miR-7 (anti-7) or 50 nM anti-miRNA control 
(anti-Ctl), together with 0.2 ug miR-7 (pJEBB-7) or miR-769 (pJEBB-769) and 
0.1 ug psiCheck reporter. The relative levels of Renilla luminescence (RL) 
normalized to firefly luminescence are plotted. Error bars represent s.d. (n = 3). 
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c-e, (RT-PCR analysis of human SNCA mRNA (c), EGFR mRNA (d) and IRS2 
mRNA (e) levels in HeLa-ciRS-7 (light grey line) or HeLa-EV cells (dark grey 
line) transfected with increasing concentrations of miR-7 as indicated. 

f-h, qRT-PCR analysis of human SNCA (f), EGFR (g) and IRS2 (h) mRNA 
levels in HeLa-ciRS-7 cells transfected with miR-671 mimic short interfering 
RNA (miR-671; light grey line) or negative control (N.C.; dark grey line) siRNA 
24h before transfection with increasing concentrations of miR-7. Error bars 
represent s.d. (n = 4). *P < 0.05; **P<0.01. 
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miR-7 sensitivity compared to control transfections (Fig. 3f-h). This 
suggests that miR-671 probably functions as an indirect regulator of 
miR-7 activity by targeting and reducing ciRS-7 levels. In line with this, 
the sponge effect correlates with ciRS-7 expression levels (Supplemen- 
tary Fig. 8b), which after extrapolation indicates that the much higher 
expression of ciRS-7 in the brain exerts a stronger sponge efficiency 
here compared to the effect shown in HeLa-ciRS-7. 

The existence of a circRNA acting as a potent miRNA sponge 
immediately raises the question whether circular miRNA sponges 
are a more general phenomenon. Indeed, by predicting miRNA target 
sites in the other well-known circular, testis-specific Sry RNA, we 
identified 16 putative target sites for miR-138 (Fig. 4a). To study a 
direct interaction between miR-138 and Sry, we devised a vector 
expressing Sry circRNA (Supplementary Fig. 10a). Biogenesis of cir- 
cular Sry was verified by northern blotting and RT-PCR (Supplemen- 
tary Fig. 10b, c). To validate that Sry circRNA can act as a miR-138 
sponge, we inserted the Sry sequence into a luciferase reporter con- 
struct and tested the knockdown potential of miR-138 in the presence 
or absence of circular Sry RNA. Indeed, after co-expression of miR- 
138, we observed a fourfold reduction of relative luminescence com- 
pared to miR-769 expression, suggesting that the miR-138 target sites 
facilitate a miRNA effect (Fig. 4b). By co-expression of circular Sry 
RNA, the knockdown potential of miR-138 was significantly dimi- 
nished (Fig. 4b), however, presumably owing to modest circular 
RNA biogenesis from the Sry expression vector (10- to 20-fold 
below ciRS-7, data not shown), the sponge effect here was less pro- 
nounced. Immunoprecipitation of Myc-tagged AGO2 from miR-138- 
transfected cells resulted in a more than tenfold enrichment of Sry 
circRNA compared to miR-769-transfected cells (Fig. 4c). Further- 
more, using biotin-labelled miR-138, Sry circRNA was specifically 
captured (sixfold enrichment compared to biotin-labelled miR-7; 
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Figure 4 | Circular Sry RNA interacts with miR-138. a, Diagram of the 
mouse Sry locus indicating splice acceptor and splice donor sites facilitating Sry 
circularization. Perpendicular lines represent putative target sites for miR-138 
(target site code as in Fig. 1a). b, Luciferase reporter assays with the entire Sry 
sequence. Luminescence was measured 48 h after transfection with the 
luciferase reporter (psiCheck-Sry) and expression vectors for miR-138 
(pJEBB-138) or miR-769 (pJEBB-769) and with empty vector (pcDNA3; EV) 
or Sry expression vector (pcDNA3-Sry). The relative levels of Renilla 
luminescence normalized to firefly luminescence are plotted. Error bars 
represent s.d. (1 = 3). *P < 0.05. c, Immunoprecipitation of Myc-tagged AGO2 
from HEK293 cells co-transfected with the Sry expression vector, Myc-AGO2 
or empty vector and pJEBB-138 or pJEBB-769 as noted. Mouse Sry circRNA 
and human GAPDH levels were quantified by qRT-PCR and the relative 
immunoprecipitate/input ratios are plotted. Error bars represent s.d. (n = 4). 
d, As described in Fig. 2b, 3'-biotinylated miR-7 or miR-138 was transfected 
into HEK293 cells along with the Sry expression vector. Sry circRNA and 
GAPDH levels were analysed in the input and bound fractions by RT-PCR. 
Error bars represent s.d. (n = 4). 
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Fig. 4d), suggesting that Sry circRNA is able to interact with 
miR-138. Thus, we propose that the circular Sry RNA functions as a 
miR-138 sponge. 

Recently, circRNA was found to be a highly prevalent RNA species 
in the human transcriptome”. Our observation that very abundant 
endogenous circRNA molecules, which are inherently resistant to 
exonucleolytic RNA decay, can serve as efficient miRNA sponges, adds 
to the growing repertoire of regulatory functions in gene expression. 
Considering the widespread involvement of miR-7 as a key regulator 
of various cancer pathways”'** and the suggested implication of miR-7 
in Parkinson’s disease by direct targeting of «-synuclein protein 
expression’’, ciRS-7 probably serves as a crucial factor significantly 
engaged in the functioning of neurons as well as a responsible candi- 
date in neurological disorders and brain tumour development. 


METHODS SUMMARY 


Transfections were conducted using calcium phosphate procedure or Lipo- 
fectamine 2000 (Invitrogen). RNA was isolated using TRIzol (Invitrogen) accord- 
ing to manufacturer’s protocol. Northern blotting was conducted on 1.2% agarose 
or 12% PAGE, transferred to Hybond-N* membranes (GE Healthcare) and hybri- 
dized in Church buffer (see Methods). AGO immunoprecipitation was conducted 
as described previously”*. Biotin-labelled miRNAs were transfected into HEK293 
cells at a final concentration of 20nM and treated similar to AGO immunopre- 
cipitation. FISH was performed on paraformaldehyde-fixed cells transfected with 
constructs as indicated, and treated as described in Methods. Mouse brain sections 
(30 or 541m) were made from 8-9-week-old C57BL/6 male mice. Subsequent 
in situ hybridization was performed using custom-designed alkaline phospha- 
tase-labelled DNA and LNA probes. Luciferase assays were performed with the 
dual-luciferase reporter assay kit (Promega). Reverse transcription and quantita- 
tive PCR were conducted using MLV-RT (Invitrogen) and Platinum SYBR Green 
qPCR Supermix UDG (Invitrogen), respectively. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Constructs and transfections. HEK293 Flp-In T-Rex (Invitrogen) and HeLa 
Flp-In (Invitrogen) cells were maintained under standard culture conditions. 
Transfections were conducted with either calcium phosphate using standard 
procedures or Lipofectamine 2000 (Invitrogen) according to supplier’s protocol. 
The construction of pJEBB-769 has been published previously*. pJEBB-7 and 
pJEBB-138 were constructed similarly; PCR amplification of the miR-7-1 locus 
was performed using miR-7 forward/reverse primers, or the miR-138 locus 
(in mouse) using miR-138 forward/reverse primers, and the fragments were 
inserted into a NotI- and Sall-digested enhanced green fluorescent protein 
(eGFP) expression cassette (pJEBB). pCK-7 was generated by digesting pJEBB-7 
with NotI and Xbal and inserting the resulting fragment into Notl- and Xbal- 
digested pCK. 

pcDNA3-ciRS-7-ir was constructed by PCR amplifying the circRNA locus, 
including 1kb upstream and 200bp downstream to the nonlinear splice sites, 
using ciRS-7 forward/reverse primers. The PCR fragment was inserted into 
HindIII- and NotI-digested pcDNA3 (Invitrogen). An ~800-bp DNA stretch 
upstream of the splice acceptor was amplified using ciRS-7-ir forward/reverse 
primers and inserted downstream in the reverse orientation in an Xhol-digested 
pcDNA3-ciRS-7-ir, thus generating pcDNA3-ciRS-7. For the generation of 
pcDNA3-ciRS-fs, PCR amplification of the ciRS-7 exon using ciRS-7-fs forward/ 
reverse primers was inserted into pcDNA3 using HindIII and Xhol. To establish 
HeLa cells stably expressing ciRS-7, ciRS-7 was copy-pasted into pcDNA5 using 
HindIII and Apal, and transfected into HeLa Flp-In cells (Invitrogen) along with 
pOG44. Forty-eight hours after transfections, the cell media was supplemented 
with hygromycin B according to manufacturer’s protocol. 

pcDNA3-Sry was constructed by PCR amplifying the Sry locus from male 
mouse DNA using Sry forward/reverse primers, and inserting the amplicon into 
pcDNA3 using BamHI and NotI. Subsequently, Sall-digested CMV promoter 
from pcDNAS was inserted into the Xhol site in an inverse orientation. 

Luciferase reporter vectors were constructed by inserting annealed primers 
(miR-7-psiCheck forward/reverse) or PCR amplicons (produced by ciRS-7- 
psiCheck forward/reverse, Sry-psiCheck forward/reverse or EGFR-psiCheck for- 
ward/reverse) into the psiCheck-2 vector. 

Transfection of miRNA mimics (miR-7-5p and miR-671-5p, Applied Bio- 

systems) was conducted using Lipofectamine 2000 (Invitrogen). In case of dose- 
gradient transfections, a total concentration of 10 nM was used with varying ratios 
between miR-7 and negative control oligonucleotide, as indicated. 
Northern blotting. For miRNA, 20-30 ig whole cell RNA was loaded on a 12% 
denaturing PAGE gel. RNA was transferred to Amersham hybond-N+ mem- 
branes (GE Healthcare). The membranes were hybridized with [°*P]-labelled 
DNA oligonucleotides (listed in Supplementary Table 1) in Church buffer 
(0.5 M NaPO,, 7% SDS, 1mM EDTA, 1% BSA, pH_7.5) at 37 °C and washed in 
2X SSC (300 mM NaCl, 30 mM Na-citrate, pH 7.0) with 0.1% SDS at room tem- 
perature. The membranes were exposed on phosphorimager screens and analysed 
using Quantity One or Image Lab software (Bio Rad). 

For circRNA, agarose northern blot was performed with 5-10 ug RNA sepa- 
rated in 1.2% agarose. Subsequent hybridization and wash was carried out at 55 °C 
and 50 °C, respectively, otherwise conducted as described above. 

AGO immunoprecipitation. Myc-tagged AGO2 was co-transfected with either 
pJEBB-7 or pJEBB-769 in p10 dishes. Forty-eight hours after transfection, AGO 
immunoprecipitation was performed as described previously”. In brief, cells were 
lysed in 150 mM KCl, 25 mM Tris-HCl, pH 7.4, 5 mM EDTA, 0.5% Triton X-100 
and 5mM dithiothreitol (DTT) supplemented with Ribolock (Fermentas) and 
proteinase inhibitor cocktail (Roche). The lysate was mixed with Myc antibody 
(Abcam, ab9106) coupled sepharose beads (GE Healthcare) and left under rota- 
tion for 4h at 4°C. Beads were subsequently washed five times in lysis buffer and 
the RNA was extracted using TRIzol reagent (Invitrogen). 

Streptavidin aptamer. T7 transcription was performed on PCR products contain- 
ing the entire circRNA sequence with or without the inclusion of a streptavidin 
aptamer sequence using T7 MEGAscript (Ambion). The streptavidin-based 
capturing of RNA with associated proteins was conducted as previously pub- 
lished'*. In brief, the T7 transcript was renatured and mixed with streptavidin 
beads (Dynabeads MyOne Streptavidin C1, Invitrogen) for 1 h at 4 °C and washed 
twice in lysis buffer (10 mM HEPES, pH 7.0, 200mM NaCl, 1% Triton X-100, 
10 mM MgCl, and 1 mM DTT). Cells transfected 48 h before collection with either 
pJEBB-7 or pJEBB-769 were lysed and added to the RNA-coupled beads using half 
a confluent p10 dish per streptavidin capture. After rotating for 4h at 4 °C, the 
beads were washed five times and RNA and protein were extracted and analysed 
by northern blot (probes listed in Supplementary Table 1) and by western blot 
using anti-AGO2 (Abcam, ab57113) and anti-HuR (Santa Cruz, SC-5261) as a 
loading control. 
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Biotin-coupled miRNA capture. Biotin-labelled miRNAs were transfected into 
HEK293 cells at a final concentration of 20 nM. Forty-eight hours after transfec- 
tion, cells were collected and treated as described above. 

Immunofluorescence and RNA-FISH. Cells were seeded on glass cover- 
slips in 12-well plates and transfected with ciRS-7 and pCK-7 using calcium 
phosphate (HEK293 cells) or Lipofectamine 2000 (HeLa cells). Forty-eight 
hours after transfection, cells were washed in PBS and fixed in 4% parafor- 
maldehyde for 15min and permeabilized overnight in 70% ethanol. Cells 
were then washed twice in PBS containing 5 mM MgCl, (PBSM), and rehydrated 
for 10min in 50% formamide and 2X SSC. In case of immunofluorescence, 
cells were blocked with 3% BSA in PBSM for 1h followed by incubation 
with primary antibody (human DCPIA (ref. 26)) in 3%BSA/PBSM supple- 
mented with RNaseOUT (Fermentas) at 37°C. After washing three times in 
PBSM, cells were incubated with secondary antibody (Alexa Fluor 488, A21206, 
Invitrogen). Cells were then washed three times before proceeding with RNA- 
FISH. 

For FISH, cells were incubated at 37°C in a solution containing 50% forma- 
mide, 2X SSC, 0.25 mg ml! Escherichia coli transfer RNA, 0.25 mg ml! salmon 
sperm DNA (Invitrogen), 2.5 mgml ! BSA (Roche), and 0.5 ngml | fluores- 
cently labelled ciRS-7 (Supplementary Table 1) and miR-7 (Exiqon) probes. 
After 3h, cells were washed twice for 20 min at 37°C in 50% formamide and 
2X SSC, followed by four 5-min washes in PBS (the penultimate wash containing 
4',6-diamidino-2-phenylindole (DAPI)) and an additional brief wash in nuclease- 
free water. Cells were mounted in ProLong Gold (Invitrogen) and left overnight at 
room temperature. 

Mice for the in vivo study. This study was performed in accordance with guide- 
lines approved by the Danish Animal Ethical Committee (no. 2011/561-1950), 
using 8-9-week-old C57BL/6 male mice (n = 5) (Taconic). The mice were eutha- 
nized by cervical dislocation, the brains frozen in gaseous CO, and processed into 
30-um or 5-41m coronal cryostat sections. For paraformaldehyde-fixed tissue, 
brains were fixed and processed into 16-l1m sections for histological analysis as 
described previously”. 

In situ hybridization. In situ hybridization was performed as previously 
described** using custom designed alkaline phosphatase-labelled DNA and 
LNA probes (Supplementary Table 1) recognizing mouse Gapdh, ciRS-7 and 
miR-7 (DNA Technology A/S). Double fluorescence in situ hybridization was 
performed on fixed tissue. Sections were rinsed for 10 min in TBS buffer contain- 
ing 0.5% Triton X-100 (ref. 27) and incubated overnight with Maxima Probe/ 
master mix (Thermo Scientific) containing Cy3-labelled ciRS-7 and Cy5-labelled 
miR-7 probes. Development was arrested by rinsing the sections at 37°C in tap 
water and distilled water, before mounting with Aquatex (Merck). The specificity 
of the signal was tested by hybridizing parallel sections with: (1) 100-fold excess 
of unlabelled probe, (2) pre-treatment with RNase A (Pharmacia Biotech), (3) 
buffer alone, or (4) Cy5-labelled miR-449 probe. All controls were devoid 
of signal. 

Estimation of the percentage of ciRS-7- and miR-7-co-expressing neurons 
was performed in layer IV-V of the cingulate gyrus. ciRS-7- and miR-7- 
expressing neurons were counted on digitalized images of pairs of adjacent 
5-m thick sections. Per section, 75 randomly selected ciRS-7-expressing neurons 
were counted and analysed for co-expression of miR-7. Only ciRS-7-expressing 
neurons with a clearly identifiable nucleus were analysed. For control, a com- 
parable number of circular regions devoid of ciRS-7 was analysed for miR-7 
expression. 

Luciferase reporter assay. HEK293 cells seeded in 12-well plates and collected 
48h after transfection using the dual-luciferase reporter assay kit (Promega). In 
case of anti-miRNA experiments, anti-miRNA inhibitor (Applied Biosystems) 
was used at a final concentration of 50nM. Luminescence was measured on a 
BMG FLUOstar luminometer (BMG labtech). 

Tobacco acid pyrophosphatase exonuclease and RNase R treatment. In case 
of tobacco acid pyrophosphatase exonuclease, 4 ug of RNA was incubated with 
10 U of tobacco acid pyrophosphatase (Epicentre) at 37 °C for 1h. After incuba- 
tion, the RNA was ethanol precipitated and incubated with 1 U of Terminator 
5'-dependent exonuclease (Epicentre). In case of RNase R, 41g of RNA was 
incubated with 20 U of RNase R (Epicentre) at 30 °C for 10 min. In both cases, 
half the RNA was visualized by northern blotting, the other half was subjected to 
qRT-PCR as described below. 

qRT-PCR. Total RNA was extracted from cells in culture using TRIzol reagent 
(Invitrogen) according to standard procedures. Complementary DNA for qRT- 
PCR expression analyses was synthesized from total RNA by MLV-RT 
(Invitrogen) according to the supplied protocol using random hexamer primer. 
Quantitative PCR was performed with Platinum SYBR Green qPCR Supermix 
UDG (Invitrogen) on either MxPro3000 cycler (Stratagene) or LightCycler 480 
(Roche) according to standard procedures, using primers listed in Supplementary 
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Table 1. RNA levels were normalized to human GAPDH. In case of miRNA 
Taqman qPCR, miR-7, miR-769 and RNU48 assays (Applied Biosystems) were 
used adhering to manufacturer’s protocol. All quantitative PCR reactions were 
conducted in quadruplicates. 

In case of 3’-rapid amplification of CDNA ends (RACE), cDNA synthesis was 
primed by oligo-dTN. PCR was subsequently conducted using the 3’-RACE 
reverse/ciRS-7 (2) forward primer set. 

Statistical analysis. Statistical significance was determined by the standard two- 
tailed Student’s t-test. 
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Autophagosomes form at ER-mitochondria 


contact sites 


Maho Hamasaki!**, Nobumichi Furuta**, Atsushi Matsuda*’?, Akiko Nezu’?, Akitsugu Yamamoto®, Naonobu Fujita’, 
Hiroko Oomori’, Takeshi Noda!?, Tokuko Haraguchi*”, Yasushi Hiraoka’, Atsuo Amano*® & Tamotsu Yoshimori’? 


Autophagy is a tightly regulated intracellular bulk degradation/recyc- 
ling system that has fundamental roles in cellular homeostasis’. 
Autophagy is initiated by isolation membranes, which form and 
elongate as they engulf portions of the cytoplasm and organelles. 
Eventually isolation membranes close to form double membrane- 
bound autophagosomes and fuse with lysosomes to degrade their 
contents. The physiological role of autophagy has been determined 
since its discovery, but the origin of autophagosomal membranes has 
remained unclear. At present, there is much controversy about the 
organelle from which the membranes originate—the endoplasmic 
reticulum (ER), mitochondria and plasma membrane’”. Here we 
show that autophagosomes form at the ER-mitochondria contact site 
in mammalian cells. Imaging data reveal that the pre-autophagosome/ 
autophagosome marker ATG14 (also known as ATGI4L) reloca- 
lizes to the ER-mitochondria contact site after starvation, and the 
autophagosome-formation marker ATG5 also localizes at the site 
until formation is complete. Subcellular fractionation showed 
that ATG14 co-fractionates in the mitochondria-associated ER 
membrane*> fraction under starvation conditions. Disruption of 
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Figure 1 | ATGI4 assembles at the ER-mitochondria contact site. a, COS7 
cells co-transfected with GFP-ATG14 and DsRed-ER or RFP-SEC618 (an ER 
marker) were immunostained for the mitochondria marker TOMM20. Scale 
bars, 2 tm. b, An immunogold electron micrograph of starved HeLa cells 
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the ER-mitochondria contact site prevents the formation of ATG14 
puncta. The ER-resident SNARE protein syntaxin 17 (STX17) binds 
ATG14 and recruits it to the ER-mitochondria contact site. These 
results provide new insight into organelle biogenesis by demonstrat- 
ing that the ER-mitochondria contact site is important in autophago- 
some formation. 

ATG14 is a unique subunit of the autophagy-specific phosphatidyl- 
inositol-3-OH kinase (PI(3)K) complex involved in autophagosome 
formation®’. The ATG14 complex localizes to the isolation membrane, 
the cytoplasm and the ER. The anchoring of ATG14 to the ER is 
essential for autophagosome formation: under fed conditions, the 
ATGI14 complex seems to diffuse within the ER membrane; after 
induction of autophagy by starvation, the complex assembles at several 
specific points before autophagosome formation’. Three-dimensional 
electron tomographic studies have shown that the isolation membrane 
is ‘sandwiched’ between two ER cisternae, one of which is connected to 
the isolation membrane by a narrow extension*”. Taken together with 
a previous study showing that the ER-associated membrane known as 
the omegasome is the formation site’®, it is thought that the ER is the 
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expressing GFP-ATG14 probed with anti-GFP antibodies (red arrows). Scale 
bar, 200 nm. c, Immunoblots of subcellular fractions isolated from HEK293 
cells stably expressing GFP-DFCP1. Cyto, cytosol; micro, microsome; mito, 
mitochondria; PNS, post-nuclear supernatant. 
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Table 1 | Quantification of GFP-ATG14 localization 


Localization of ATG14 dots n=380 
Adjacent to or on both ER and mitochondria 201 (52.9%) 
Adjacent to or on ER only 177 (46.5%) 
Adjacent to or on mitochondria only 1(0.3%) 
On neither organelle 1(0.3%) 


ndenotes the number of ATG14 dots (described in Fig. 1a). 


primary platform for autophagosome formation. Because the ER and 
mitochondria contact each other, we examine the relationship between 
autophagosome formation sites and the ER-mitochondria contact 
sites by triple-colour imaging. We show that ATG14 assembled exclu- 
sively at the ER-mitochondria contact site under starvation conditions 
(Fig. 1a, Table 1 and Supplementary Fig. 2a). Using immunoelectron 
microscopy, we observed that green fluorescent protein (GFP)-tagged 
ATGI14 accumulated at ER-mitochondria contact sites under starved 
conditions (Fig. 1b and Supplementary Fig. 3a, b), and confirmed these 
observations in cells expressing Myc-ATG14 (Supplementary Fig. 3c, 
d). Our results support the idea that autophagosome formation takes 
place at the ER-mitochondria contact site during starvation. 

We recovered the mitochondria-associated ER membrane (MAM)’, 
which includes the contact sites, by Percoll centrifugation, identifying 
fractions using two MAM markers, the Ca**-binding chaperone 
calnexin (CNX) and fatty acid CoA ligase 4 (FACL4), as well as 
mitochondrial translocase of outer mitochondrial membrane 20 
(TOMM20)''". Under fed conditions, endogenous ATG14 was not 
present in the MAM fractions (Fig. 1c); under starved conditions, 
however, the localization of ATG14 shifted markedly to the MAM. 
The localization of other components of the PI(3)K complex shifted in 
a similar manner to that of ATG14 (Fig. 1c). After starvation, double 
FYVE domain-containing protein 1 (DFCP1) translocated to the ome- 
gasome, which serves as a platform for autophagosome formation”®. 
GFP-DFCP1 also shifted to the MAM under starved conditions 
(Fig. 1c). These results suggest that the ATG14 complex, as well as 
DFCP1, relocalizes to the MAM fractions during starvation. 

An additional marker, ATGS5, was used to observe autophagosome 
formation. ATGS binds only to the isolation membrane (which even- 
tually becomes an autophagosome) and detaches from the membrane 
after autophagosome formation completion” it is thus an ideal and 
specific marker for the initial stages of autophagosome formation. Like 
ATGI14, we found that 52.8% of ATG5 dots were observed at the 
ER-mitochondria contact site (Fig. 2a and Supplementary Fig. 2b, c). 


a b 


Starved 


YFP-ATGS 
RFP-SEC61B 


S 


n= 1,084 ATGS5 dots 


Anti-TOMM20 


n = 1,084 ATGS dots 


ER Mitochondria 


YFP-ATGS dots (%) 
os 
o 
YFP-ATGS dots (%) 


(0) 2 4 6 8 
Distance (4m) 


Distance (um) 


Figure 2 | Isolation membrane forms at ER-mitochondria contact sites. 

a, COS7 cells stably expressing yellow fluorescent protein (YFP)—-ATG5 were 
transfected with RFP-SEC618 and teal fluorescent protein (TFP)-mito, and 
immunostained for TOMM20. b, Time-lapse images of COS7 cells transfected with 
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Live-cell imaging was performed with three cameras to observe three 
markers simultaneously. Despite the fact that ATG5 is not an ER- 
resident protein, 98% of ATGS5 dots were found on the ER after star- 
vation (Fig. 2c). In fact, during the entire autophagosome formation 
process, most ATGS dots never separated from the ER-mitochondria 
contact site (Fig. 2b, Supplementary Fig. 4 and Supplementary Videos 
1-3). According to the live-cell imaging analysis, 98% of ATG5 dots 
localized on the ER and 79% were within 0.88 xm of the mitochondria 
(Fig. 2c, d). Localization near both organelles remained throughout the 
autophagosome formation process (Fig. 2e, f). However, unlike the 
stable association with the ER, association with mitochondria dynami- 
cally oscillated (but remained within 0.5 1m), which may explain why 
only 50% of ATGS5 dots were observed on mitochondria of fixed cells. 
These observations show that the ER is the platform of autophagosome 
formation, and dynamic mitochondria association may have a role in 
providing components required for this process. 

When the mitochondrial protein voltage-dependent anion channel 1 
(VDAC1)" was overexpressed, it localized at ER-mitochondria contact 
sites, indicating its viability as an ER-mitochondria contact-site marker 
(Supplementary Fig. 5a). In live-cell imaging, GFP-ATGS5 dots often 
appeared next to mCherry-VDACI1 dots and subsequently disappeared 
with time, whereas mCherry-VDAC1 dots remained, indicating the 
release of ATG5 from the ER-mitochondria contact site after comple- 
tion of the autophagosome formation (Supplementary Fig. 5b and 
Supplementary Video 4). These results strongly suggest that autopha- 
gosome formation takes place at the ER-mitochondria contact site. 

Phosphofurin acidic cluster sorting protein-2 (PACS-2), a cytosolic- 
sorting protein, has been implicated in the formation of ER- 
mitochondria contacts, and its knockdown uncouples mitochondria 
from the ER”. Puncta structures of GFP-ATG14 were barely visible in 
cells transfected with PACS2 short interfering RNA (siRNA), a similar 
result to the phenotype of cells deficient in mitofusin 2 (MFN2), which 
tethers the ER to mitochondria’®”” (Supplementary Fig. 6a). In starved 
PACS-2 knockdown and MFN2 knockdown cells, puncta structures of 
GFP-LC3, which represent the isolation membrane/autophagosome/ 
autolysosome’’, were attenuated (Supplementary Fig. 6b). Indeed, the 
knockdown of PACS-2 caused a decrease in the lipidated form of LC3- 
II, which binds to autophagic membranes, and inhibited its degrada- 
tion by autophagy (Supplementary Fig. 6c, d). Furthermore, siRNA 
knockdown of PACS-2 significantly decreased the levels of FACL4, 
ATG14 and GFP-DFCP1 in the MAM fraction, whereas only neg- 
ligible changes were observed in other fractions (Supplementary 
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YFP-ATG5, RFP-SEC618 and TFP-mito under starved conditions. 

c, d, Distribution of distances of YFP—ATGS5 dots from the centre to the surface of 
the ER (c) and to the surface of mitochondria (d). e, f, Distances of YFP—ATG5 dots 
over their lifetime for data from c (e) and d (f), respectively. All scale bars, 2 um. 
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Figure 3 | STX17 regulates the relocalization of ATG14 to MAM. a, COS7 
cells expressing GFP-SEC61f and STX17 were immunostained for TOMM20. 
b, HeLa cells expressing GFP-ATG14 were immunostained for STX17. 

c, Immunoblot of subcellular fractions of HEK293 cells. d, STX17 knockdown 
COS7 cells expressing GFP~ATG14 and RFP-SEC61 were immunostained 
for TOMM20. Error bars denote s.d. e, Immuoblot of subcellular fractions of 


Fig. 6e). These findings were also observed in MFN2 knockdown cells 
(Supplementary Fig. 6f), and suggest that the inhibition of the reloca- 
lization of ATG14 and DFCP1 to the ER-mitochondria contact site 
prevents the initiation of autophagosome formation. 

Along with ATG14 and DFCP1, another ER protein, STX17, was 
seen to relocate to the ER-mitochondria contact sites after starvation 
(Fig. 3a). We identified STX17 in a screen for ER-resident SNARE 
proteins involved in autophagy against invading group A Strepto- 
coccus (GAS)'°”° (Supplementary Fig. 7). STX17 is known to be a 
Qa-SNARE protein (a subfamily of Q (glutamate-containing)- 
SNARE proteins that occupy the same position as STX1A in the 
SNARE complex), but its function is not known’'. STX17 dots at 
ER-mitochondria contact sites were extensively colocalized with 
ATG14 under starvation conditions (Fig. 3b). Furthermore, STX17 
was abundant in microsome fractions under fed conditions, whereas 
it was mostly enriched in the MAM fraction of starved cells (Fig. 3c). 
Neither PACS-2 knockdown nor MFN2 knockdown had an effect on 
STX17 expression, but depletion of both inhibited the relocation of 
STX17 to the MAM fraction (Supplementary Fig. 6e, f). 

GFP-ATG14 dots did not locate at the ER-mitochondria contact 
site in STX17 knockdown cells under starved conditions, although 
they did colocalize with other part of the ER (Fig. 3d). Consistently, 
ATGI14 and GFP-DFCP1 were clearly decreased in the MAM fraction 
(Fig. 3e). By contrast, ATG14 depletion exhibited no effect on star- 
vation-induced relocation of STX17 to the ER-mitochondria contact 
site (Supplementary Fig. 8). These data indicate that STX17 acts 
upstream of ATG14, and is responsible for directing ATG14 to the 
ER-mitochondria contact site. Given the functional relationship 
between STX17 and ATG14, we performed co-immunoprecipitation 
experiments. Myc-STX17 did not significantly co-immunoprecipitate 
with GFP-DFCP1 under starved conditions, whereas it did pull 
down Flag-ATG14 (Fig. 3f and data not shown). Similarly, endogen- 
ous ATG14 as well as other components of the complex clearly 
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co-immunoprecipitated with Myc-STX17 under starved conditions, 
whereas ATG14 barely precipitated in the absence of Myc-STX17 
(Fig. 3g). These results suggest that Myc-STX17 associates with 
ATG14 only after starvation. 

Whereas STX17 knockdown hinders the recruitment of ATG14 to 
the ER-mitochondria contact site, ATG14 was still able to assemble at 
other locations (Fig. 3d), indicating that STX17 is necessary for the 
recruitment of ATG14 to the ER-mitochondria contact site but not for 
the assembly of ATG14 dots. LC3-positive puncta were formed and 
accumulated in STX17 knockdown cells, indicating that the autopha- 
gosome maturation step was arrested (Supplementary Fig. 10b). We 
examined LC3 puncta by correlative light and electron microscopy in 
STX17 knockdown cells infected by GAS, and found that LC3-positive 
puncta represent isolation membranes (Fig. 4a and Supplementary 
Fig. 9). Immunogold electron microscopy of GFP-LC3, as well as 
conventional electron microscopy, showed accumulation of isolation 
membranes in STX17 knockdown cell under starvation conditions 
(Fig. 4b, c). In control cells, autolysosomes (autophagosomes fused 
with lysosomes) but few isolation membranes were observed; by con- 
trast, in STX17 knockdown cells, isolation membranes were dominant 
whereas autolysosomes were nearly absent (Fig. 4c). Together, these 
data suggested that STX17 knockdown leads to an accumulation of 
isolation membranes deficient in autophagosome maturation. In addi- 
tion, GFP-ATG5 dots accumulated to a considerable degree in these 
cells even under fed conditions, similar to the phenotype of inhibited 
autophagosome completion in ATG4B mutants” (Fig. 4d, e). 
Consistent with this, knockdown of STX17 significantly decreased 
autophagy, as detected by several assays: (1) degradation of p62 (also 
known as SQSTM1)”* (Fig. 4f); (2) degradation of long-lived pro- 
teins”, the clearance of which depends on autophagy (Fig. 4g); (3) 
autophagic flux as measured by the amount of LC3 in the presence 
or absence of lysosomal protease inhibitors (Supplementary Fig. 10a); 
(4) fusion between autophagosomes and lysosomes, as measured by 
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Figure 4 | Functional autophagosomes do not form in STX17 knockdown 
cells. a, Correlative light and electron microscopy analysis of STX17 
knockdown HeLa cells expressing GFP-LC3 (white arrows) infected with GAS 
(red arrows). DAPI, 4’,6-diamidino-2-phenylindole. b, Electron micrographs 
of STX17 knockdown cells. White arrows denote autolysosomes, black arrows 
denote autophagosome, white arrowheads mark isolation membranes. 


monomeric red fluorescent protein (mRFP)-GFP tandem-tagged LC3 
(ref. 25) (Supplementary Fig. 10b); (5) formation of large auto- 
phagosomes engulfing GAS (Supplementary Fig. 9); and (6) autopha- 
gic killing of GAS (Supplementary Fig. 7c). These results indicate that 
knockdown of STX17 blocks the completion of the autophagosome 
formation. Therefore, it is likely that STX17-dependent recruitment of 
ATG14 to the MAM is essential for the formation of functional auto- 
phagosomes. Our most plausible explanation for the phenotypic dif- 
ferences between knockdown of STX17 and knockdown of PACS-2 or 
MEN2 is the existence of STX17. In PACS-2 or MFN2 knockdown 
cells, the disruption of the MAM allows STX17 to interact with ATG14 
(Supplementary Fig. 6g), blocking ATG14 dot formation. In STX17 
knockdown cells, the absence of STX17 frees ATG14 to initiate isola- 
tion membrane formation. 

Our findings provide new insights into two competing models for 
autophagosome biogenesis, that is, origin in the ER’ or the mito- 
chondria’’. Results that have exclusively supported either model can 
now be explained by a new model in which the autophagosome is 
formed at the ER-mitochondria contact site (Supplementary Fig. 1). 
In retrospect, it seems that research groups arguing in favour of either 
single organelle model have been observing the same phenomenon 
from different sides. Our model does not exclude a possibility that 
formation independent on the contact site could still occur. We have 
also discovered a novel role for the ER-mitochondria contact site, a 
special crossing-over point between two organelles that is involved in a 
range of physiological functions. Many questions still remain, but 
these findings open new avenues for further study of the molecular 
mechanisms of autophagy. 


METHODS SUMMARY 


All cell lines were cultured in DMEM (Wako) supplemented with 10% heat- 
inactivated FBS (Invitrogen). siRNA and siRNA negative control (Stealth RNAi) 
were purchased from Invitrogen. Cells were examined under a fluorescence laser 


392 | NATURE | VOL 495 | 21 MARCH 2013 


e Fed g 8 Bred 
Wi starved St (2 h) 
2 19 c St + 500 nM wor (2 h) 
Ss SS 6 
oO o = 
a 8 B55 
a os 
£2 6 os 4 
3 28 
6 4 28 
5 a2 
t 2 
f a P; 
. o WA Control STX17-1 STX17-2 
Sen SEEPS 
s cf siRNA 
S SS 
siRNA 


c, Immunogold electron micrographs of STX17 knockdown HeLa cells stained 
for GFP-LC3. d, GFP-ATG5 dots (arrows) in STX17 knockdown HeLa cells. 
e, Quantification of staining in d. f, Immunoblot of p62 in STX17 knockdown 
cells. Tubulin was used as a loading control. g, Degradation of long-lived 
proteins in STX17 knockdown cells. St, starved; wor, wortmannin. All error 
bars denote s.d. Scale bars, 1 [tm (a, b), 200 nm (c) and 5 ptm (d). 


scanning confocal microscope (LSM510; Carl Zeiss and FV1000; Olympus) using 
the Zeiss LSM Image Browser software (Carl Zeiss) and FLUOVIEW (Olympus). 
Live-cell imaging was performed with the Yokogawa spinning disc confocal system 
(CSU-X1; Olympus). Immunoblotting”, isolation of the MAM’, conventional elec- 
tron microscopy~, immunoelectron microscopy”, and the bulk protein degradation 
assay~’ were performed as described previously. All data are presented as mean + s.d. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Cell culture and transfection. All cell lines were cultured in DMEM (Wako) 
supplemented with 10% heat-inactivated FBS (Invitrogen). HeLa cells stably expres- 
sing enhanced GFP (eGFP)-LC3 and COS7 cells stably expressing YFP-ATG5 were 
constructed as previously described’*. HeLa cells stably expressing eGFP-ATG5 
were a gift from N. Fujita. Wild-type and ATG14-deficient mouse embryonic fibro- 
blasts were constructed as previously described’®. HEK293 cells stably expressing 
eGFP-DFCP1 were established as previously described'®. LipofectAMINE2000 
(Invitrogen) was used for transfection. For amino acid starvation, cells were cultured 
in EBSS (Sigma) without amino acids or FBS for 2h. Stable transformants were 
selected in complete medium containing 500 jg ml’ G418 (Sigma). 

Antibodies and immunoblotting. Antibodies against the following proteins were 
mouse monoclonal: Myc (MBL), GFP (Abcam), tubulin (Sigma), PDI (Stressgen) 
and CNX (BD Bioscience). Rabbit polyclonals include: LC3 (MBL), TOMM20 
(Abcam), FACL4 (Abcam), STX17 (Sigma), ATG14 (MBL), VPS34 (Cell 
Signaling), VPS15 (MBL), beclin (MBL), MFN2 (Sigma) and PACS-2 
(ProteinTech Group). Alexa Fluor-conjugated secondary antibodies (goat anti- 
mouse IgG and goat anti-Rabbit IgG) used for fluorescence microscopy were pur- 
chased from Invitrogen. Immunoblotting was performed as previously described”. 
siRNAs and plasmids. siRNA duplexes targeting STX17 (HSS123730 and 
HSS123732), STX18 (HSS122692 and HSS122693), SEC20 (also known as 
BNIP1) (HSS141385 and HSS141386), SEC22A (HSS120253 and HSS120254), 
SEC22C (HSS145024 and HSS145025), SLT1 (also known as USE1) (HSS125167 
and HSS125168), PACS2 (HSS146278 and HSS146279), ATG14 (HSS117763 and 
HSS117764), MFN2 (HSS115028 and HSS115029), and siRNA negative control 
(Stealth RNAi) were purchased from Invitrogen. siRNA for VTIIB was described 
previously’’. 

Expression vectors for mCherry-C1 plasmids’ and mRFP-GFP-LC3 (ref. 24) 
plasmids were constructed. GFP-ATG14 and Flag~ATG14 were also established’. 
mCherry- or GFP-Cyt b5-TM was constructed as described previously*®. GFP- 
SEC61B was a gift from T. Rapoport. DsRed-ER was a gift from K. Tabata. 
DsRed2-mito was obtained from Clontech. TFP-mito was constructed by repla- 
cing DsRed using PCR fragments. To construct Myc-STX17 plasmids, comple- 
mentary DNA cloned from genomic DNA of HeLa cells was inserted into 
pcDNA3.1/Myc-His(—)A (Invitrogen) using engineered EcoRI and Xhol sites. 
To construct Myc-STX17(ASNARE) plasmids, the STX17(ASNARE) domain 
was generated by PCR (primers: 5’-ATGTCTGAAGATGAAGAAAAAGTG-3' 
and 5'-TTAAGCCTTCCCTAAGTTTTTGG-3’). The PCR fragment was 
digested with EcoRI and Xhol and used to replace the STX17 cDNA in the 
Myc-STX17 plasmid. To construct Myc-ATG14 plasmids, cDNA from Flag- 
ATG14 was inserted into pcDNA3.1/Myc-His(—)A (Invitrogen) using engineered 
EcoRI and KpnI sites. To construct mCherry or GFP-~VDACI plasmids, cDNA 
from genomic DNA of HeLa cells was inserted into mCherry-Cl or EGFP-Cl 
(Invitrogen) using engineered Sall and BamHI sites. To construct mCherry or 
GFP-DRP1 plasmids, cDNA from genomic DNA of HeLa cells was inserted into 
mCherry-Cl or EGFP-Cl (Invitrogen) using engineered Sall and BamHI sites. To 
construct an mCherry-LC3 plasmid, PCR was performed to generate mCherry 
cDNA with exogenous restriction sites for BamHI and NotI at the 5’ and 3’ ends, 
respectively. The PCR fragment obtained after digestion with each restriction 
enzyme was inserted into pMRX-mCherry. LC3 cDNA cloned from the genomic 
DNA of HeLa cells was inserted into pMRX-mCherry using engineered EcoRI and 
Xhol sites. 

RT-PCR and real-time PCR. PCR with reverse transcription (RT-PCR) and 
real-time PCR were preformed as described previously”. The sequences of the 
oligonucleotides were as follows: STX17 (forward, 5'-GCCGTCTTGAACCAGCT 
ATC-3’; reverse, 5’-TTCATGCAACTTGTCCCAGA-3’), STX18 (forward, 5’- 
TCACATTGGCAAACTGAGAGAT-3'; reverse, 5'-TCTGGGCATCCTGGTC 
TATC-3'), SEC20 (forward, 5'-ACGTCCGGATCTGTAACCAA-3’; reverse, 5’- 
AAGGGTCCTGAACAATCACG-3’), SEC22A (forward, 5’-ATGCTTTCGAG 
GAAACTTGC-3’; reverse, 5'-GAGAAGGCGAGAACATTTGG-3’), SEC22C 
(forward, 5'-GAGTTTAGCCTTGCGACTGG-3’; reverse, 5'-AGGAGCAGAT 
AGCCATGCAG-3’), SLT1 (forward, 5'-GTCGAGGCTGGAGCTAAACC-3’; 
reverse, 5’-GGCCTGCAACATGTCCTCTA-3’), and GAPDH (forward, 5’- 
GTGGTCTCCTCTGACTTCAACAG-3’; reverse, 5’-CTGTAGCCAAATTCGT 
TGTCATAC-3'). The expression levels of messenger RNA are indicated as the 
relative cycle number for a given gene normalized against the cycle number for 
GAPDH. Each reaction was repeated at least three times to assess reproducibility. 
Fluorescence microscopy. Fluorescence microscopy was performed as described 
previously. To detect STX17, cells were permeabilized with 0.1% Triton X-100, 
and blocked with 1% normal goat serum (Abcam). 

Proteinase inhibitor. E64d and pepstatin A were purchased from the Peptide 
Institute. Cells were transfected with siRNA against STX17 or with control 
siRNA. After 48 h of incubation, the cells were cultured in EBSS containing E64d 


(10 pg ml ~ 1) and pepstatin A (30 pg ml ~ ‘) or ina solution lacking inhibitors (con- 
trol; dimethylsulphoxide 1.1 pg ml ') at the appropriate times. Lysates were exam- 
ined by western blotting using antibodies against LC3 and tubulin. 

GAS infection and bacterial viability assay. Infection with GAS (strain JRS4) 
and the colony-forming units (c.f-u.) viability assay were performed as described 
previously”®. 

Immunoprecipitation analysis. Immunoprecipitation analysis was performed 
using the c-Myc-tagged protein mild purification kit (MBL). Twenty-million cells 
were suspended in lysis buffer (20 mM Tris-HCl, pH 7.5, 150 mM NaCl, 5mM 
EDTA and 1% CHAPS). Cell lysates were centrifuged at 15,000g for 10 min. The 
resulting supernatant was transferred to a spin column and incubated with anti-c- 
Myc beads for 1h at 4°C. Anti-c-Myc beads in the spin column were incubated 
with elution peptide solution for 30s on ice and centrifuged for 10s. 

Electron microscopy and bulk protein degradation assay. Conventional elec- 
tron microscopy was performed as described previously*'. Immunoelectron 
microscopy was performed using a rabbit polyclonal antibody against GFP (a gift 
from N. Nakamura) by applying the pre-embedding gold enhancement method as 
described previously”. Samples were analysed with an electron microscope 
(H7600; Hitachi). Correlative light and electron microscopy was performed as 
described previously”, as was the long-lived proteins degradation assay”. In brief, 
degradation of long-lived proteins depends on the autophagic activity. Cells were 
seeded in 24-well dishes and incubated overnight. On the next day, the cells were 
exchanged into labelling medium containing ‘*C-valine (1.5 Ciml') and again 
incubated overnight. Cells were exchanged into chase medium (DMEM supple- 
mented with 10% FBS and 10 mM unlabelled valine) and further incubated for 4h 
to remove the contribution of short-lived proteins. After the chase period, media 
was replaced with either growth medium containing 10 mM valine or EBSS con- 
taining 10 mM valine (to induce autophagy). After 2 h incubation, the media were 
collected, and the trichloroacetic acid (TCA)-soluble fraction was analysed by 
scintillation counting. The cells were lysed in ice-cold RIPA buffer (25mM 
Tris-HCl, pH 7.5, 150 mM NaCl, 0.1% SDS, 1% Triton X-100, 1% deoxycholate, 
5mM EDTA and protease inhibitor cocktail (Roche)); the TCA-insoluble fraction 
was isolated and analysed by scintillation counting. To determine the rate of long- 
lived protein degradation, the count in the TCA-soluble fraction in the medium 
was divided by the total cellular count. 

Tokuyasu sampling and immunolabelling. Cells transfected with Myc-ATG14 
were pre-fixed for 10 min (4% paraformaldehyde/0.1% glutaraldehyde in PHEM 
(120 mM PIPES, 50 mM HEPES, 20 mM EGTA and 4mM MgCh, pH 6.9)) then 
fixed with fresh 4% paraformaldehyde in PHEM for 30 min at room temperature. 
Cells were scraped, resuspended in 10% gelatin/PBS, pelleted by centrifugation, 
and immediately placed on ice until the gelatin solidified. Gelatin-embedded 
samples were infiltrated with 2.3 M sucrose (in 0.1 M phosphate buffer), placed 
at 4°C overnight on a rotating wheel mounted onto sample pins, and frozen in 
liquid nitrogen. Subsequently, samples were cryo-sectioned at 60 nm thickness 
using an FC6/UC6 cryo-ultramicrotome (Leica), and selected with a 1:1 mixture of 
2% methylcellulose (0.025 Pa sts Sigma) and 2.3 M sucrose. After thawing, sec- 
tions were first reacted with anti-Myc antibodies (MBL) diluted 1:100, then with 
rabbit anti-mouse antibodies (Invitrogen) diluted 1:1,000, and finally with 10-nm 
gold-conjugated protein A following an established protocol”. For double label- 
ling, an anti-PDI antibody was used (Stressgen). Sections were embedded in 4% 
uranyl acetate/2% methylcellulose (ratio 1:9)** and viewed using a JEOL 1010 
electron microscope. 

Live-cell imaging. Live-cell imaging was performed at 37 °C using an Olympus 
IX81 microscope with a X 100 immersion objective lens (numerical aperture 1.40) 
equipped with a Yokogawa spinning disc confocal system (CSU-X1) and three 
EMCCD cameras (Andor iXon3). Images were acquired by Andor iQ live cell 
imaging software. 

Image analysis. Images were digitally processed as follows for subsequent statist- 
ical analyses. Bleed-through of the CFP channel in the YFP channel was removed 
by spectral unmixing. The background fluorescence in the YFP channel was then 
normalized using time-averaged images. Next, all channels were processed by a 
denoising algorithm to improve the signal-to-noise ratio in the fluorescence 
images™ and finally by constrained iterative deconvolution. The RFP channel is 
shown in logarithmic scale to visualize both the peripheral mesh structures and the 
thick network around the nucleus. The three-dimensional distance between the 
centre of a YFP-ATGS dot and the surface of the ER/mitochondria was measured 
with custom-made software. The mitochondria and ER had relatively constant 
fluorescence intensities and thus adaptive thresholding was effective to distinguish 
their surfaces. On the other hand, because YFP-ATGS dots have variable signal 
intensities, thresholding alone can change their size. Therefore, to reduce thresh- 
olding artefacts, centres rather than surface coordinates of the YFP-ATG5 dots 
were used. A surface of the ER/mitochondria complex was represented as the 
nearest pixel to the centre of a YFP-ATG5 dot. 
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Multidomain integration in the structure of the 
HNF-4a nuclear receptor complex 


Vikas Chandra’, Pengxiang Huang", Nalini Potluri', Dalei Wu', Youngchang Kim? & Fraydoon Rastinejad' 


The hepatocyte nuclear factor 4a (HNF-4q; also known as NR2A1) 
is a member of the nuclear receptor (NR) family of transcription 
factors, which have conserved DNA-binding domains and ligand- 
binding domains'”. HNF-4a is the most abundant DNA-binding 
protein in the liver, where some 40% of the actively transcribed 
genes have a HNF-4a@ response element’**. These regulated genes 
are largely involved in the hepatic gluconeogenic program and 
lipid metabolism***. In the pancreas HNF-4a@ is also a master 
regulator, controlling an estimated 11% of islet genes’. HNF-4a 
protein mutations are linked to maturity-onset diabetes of the 
young, type 1 (MODY1) and hyperinsulinaemic hypoglycaemia*"’. 
Previous structural analyses of NRs, although productive in elucid- 
ating the structure of individual domains, have lagged behind in 
revealing the connectivity patterns of NR domains. Here we des- 
cribe the 2.9 A crystal structure of the multidomain human HNF-40 
homodimer bound to its DNA response element and coactivator- 
derived peptides. A convergence zone connects multiple receptor 
domains in an asymmetric fashion, joining distinct elements from 
each monomer. An arginine target of PRMT1 methylation protru- 
des directly into this convergence zone and sustains its integrity. A 
serine target of protein kinase C is also responsible for maintaining 
domain-domain interactions. These post-translational modifica- 
tions lead to changes in DNA binding by communicating through 
the tightly connected surfaces of the quaternary fold. We find that 
some MODY1 mutations, positioned on the ligand-binding domain 
and hinge regions of the receptor, compromise DNA binding at a 
distance by communicating through the interjunctional surfaces of 
the complex. The overall domain representation of the HNF-4a 
homodimer is different from that of the PPAR-y-RXR-a heterodi- 
mer, even when both NR complexes are assembled on the same DNA 
element. Our findings suggest that unique quaternary folds and inter- 
domain connections in NRs could be exploited by small-molecule 
allosteric modulators that affect distal functions in these polypeptides. 

We previously reported the only high-resolution structural example 
of a multidomain NR complex, that of the PPAR-y-RXR-« hetero- 
dimer on its DNA response element’*. To understand the extent of 
domain integration in other NRs, here we analyse the crystal structure 
of the complex of HNF-4«, an obligate homodimer, bound to its DNA 
element and coactivator-derived peptides. HNF-4a uses the linear 
domain arrangement shown in Fig. la. Our efforts to crystallize full- 
length HNF-4« were unsuccessful. However, by proteolytically pro- 
bing its DNA-assembled complex, we identified an extended segment 
comprising the DNA-binding domain (DBD)-hinge-ligand-binding 
domain (LBD) portions and corresponding to residues 46-368 (Sup- 
plementary Fig. 1). Cloning, expression and purification of the sta- 
ble DBD-hinge-LBD multidomain segment made it possible to 
obtain well-diffracting crystals of a complex with its consensus res- 
ponse element and coactivator (NCOA2) peptide. Electron density 
maps for all the critical junctions of the complex are shown in 
Supplementary Figs 2-5. The response element consists of a direct 
repeat of AGGTCA half-sites with one base-pair spacing (DR1). The 


DR1 is the major consensus binding site for both HNF-4a and PPAR- 
y-RXR-o**9, 

X-ray diffraction data was collected to 29A resolution, and the 
structure refined (see Supplementary Table 1). The crystal asymme- 
tric unit contains two independent representations of the HNF-4a- 
homodimer-DNA-peptide complex. The electron density map from 
one complex and the comparison of the two complexes is shown in 
Fig. 1b, c. The two representations are nearly identical, with root mean 
squared deviations of less than 2.0 A over all their atoms. The LBD and 
DBD portions match their previously determined isolated structures 
(Supplementary Figs 6 and 7). Both DBDs are in register with their 
half-sites, interacting with the major grooves (Supplementary Figs 8 
and 9). Helix 12 of the LBDs is in the active conformation, and a 
coactivator LXXLL peptide is bound to each LBD. 

The HNF-4a% homodimer shows a striking and complex pattern of 
interfacial junctions. A central zone incorporates surfaces from both 
LBDs, the DBD of the upstream subunit, and the hinge region of the 
downstream subunit. This domain convergence zone suggests a path 
of communication between the conserved domains through their 
coupled surfaces (Fig. 2a). The LBDs, symmetrical in their mutual 
interactions when viewed in isolation, cooperate in a highly asymmet- 
ric fashion to straddle the surface of only the upstream DBD (Fig. 1d). 
As a result, the overall complex appears partitioned towards the 
upstream half of the DR1, and adopts a highly asymmetrical organi- 
zation for a homodimeric transcription factor. A previous study sug- 
gested that HNF-40 homodimers could bind asymmetrically to their 
DNA response elements®. The resulting quaternary arrangement cre- 
ates precisely the correct DBD-to-DBD distances needed to match the 
geometric constraints of the two AGGTCA half-sites and their inter- 
vening spacer. At the same time, the quaternary organization renders 
both LBD pockets and their coactivator-interacting surfaces unen- 
cumbered, allowing free access to both ligands and LLXXLL elements, 
respectively. 

The interface that forms between the DBD of the upstream subunit 
and the hinge region of the downstream subunit is one important 
domain-domain interface of the complex, and is reminiscent of an 
interaction we described previously in the PPAR-y-RXR-« complex”. 
The resulting arrangement places the two DBDs in a solid head-to-tail 
arrangement that extends their combined footprint to match their 
DRI contact surface perfectly. The manner in which the two LBDs 
cooperate to interact with the upstream DBD suggests that the physical 
integration of all three domains may be required for high-affinity DNA 
binding (Fig. 1d). To test this idea, we measured the DNA affinity of 
the HNF-4a protein when various segments of the protein were 
removed (Fig. 1d.) Measuring first the DNA-binding affinity of the 
HNF-4« that only contains its DBD and hinge portions, we observed 
very weak binding to DRI with an equilibrium dissociation constant 
(Kg) of approximately 6,000 nM. When the LBD portion of the recep- 
tor is contained within the polypeptide, the complex showed a 75-fold 
enhanced affinity for DR1, with a Kg of approximately 80 nM (see 
Fig. le and Supplementary Table 2). These results are consistent with 
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Figure 1 | Overall organization of the HNF-4a homodimer on DNA. 

a, Linear depiction of the HNF-4a protein domains. b, Electron density map 
(2F, — F,) for one of the two HNF-40-homodimer-DNA complexes of the 
crystallographic asymmetric unit. c, Superposition of the two independent 
HNF-4a-homodimer-DNA complexes in the asymmetric unit. The two 
homodimeric complexes are coloured orange/yellow in one case, and blue/ 
purple in the other. The ligands in the LBD are shown in blue, Zn is shown in 
grey, and the coactivator peptides are shown in red/brown. Numbers with 
prime refer to the upstream-positioned subunit. d, The positioning of both 
LBDs in a complex on top of the upstream DBD enables high-affinity DR1 
DNA binding. e, Contribution of various receptor domains to the DNA- 
binding affinity of HNF-4%. DNA binding is measured using fluorescent 
polarization studies with a 5’ fluorescein isothiocyanate (FITC)-labelled DR1. 
The x-axis shows concentration of the HNF-4« protein, and the y-axis shows 
the fractional DNA bound. Removal of the AB domain (AAB), F domain (AF) 
or both (AABAF) does not alter the DNA affinity compared to the full-length 
(FL) receptor. However, removal of the LBD reduces the affinity of the resulting 
DBD-hinge (DBD) region to a Kg of approximately 6,000 nM, whereas the 
presence of the LBD together with the DBD hinge (AABAF) allows DNA 
binding with a Kg of 82 nM (see also Supplementary Table 2). 


our observation that the LBD and DBD modules are physically and 
functionally integrated to establish high-affinity DNA binding. They 
are also consistent with a previous study that showed that the LBD 
significantly enhances the half-life of the HNF-4«0 DNA-binding 
complex™. 

We next measured the DNA-binding contributions of the amino 
(N)-terminal (AB region) and carboxy (C)-terminal (F region) por- 
tions of the polypeptide, both of which were removed from our crys- 
tallization construct (Fig. 1a). We found little contribution from these 
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LBD-LBD 


LBD-LBD 


Figure 2 | Domain-domain contacts of HNF-4a. a, Circle indicates the 
convergence centre, where four domains (both LBDs, the upstream DBD and 
the downstream hinge) come together. b, Arg 91, located on the surface of the 
DBD, inserts deeply into a pocket at the base of the LBD-LBD surface. 
Numbers with prime refer to the upstream-positioned subunit. c, Ser 78, 
positioned on the back side of the DNA recognition helix, also enables the 
convergence of the DBD and the LBDs. 


segments to the overall affinity of the complex for DNA, when exa- 
mined individually or in combination (Fig. 1d). The proteolytically 
sensitive nature of these regions, even in the DNA-binding complex of 
HNF-4«, also suggests that they are poorly ordered and not involved in 
DNA binding (Supplementary Fig. la-c). We also prepared the iso- 
lated AB and F domain fragments of HNF-40 and tested their ability to 
bind to the rest of the homodimer-DNA complex. However, we 
detected no appreciable binding of these receptor portions with the 
rest of the complex (Supplementary Fig. 1d). 

Both HNF-4« LBDs have electron density profiles consistent with a 
trapped fatty acid, with the size indicating a myristic acid derived from 
Escherichia coli, in which the protein was expressed (Supplementary 
Fig. 2)'°'°. The fatty acid is believed to lend structural integrity to the 
HNF-40/y subfamily. Linoleic acid has been shown to be an exchange- 
able and potentially endogenous ligand of HNF-4a, although this 
molecule does not confer transcriptional activity’’. A stabilizing fatty 
acid, or a silent molecule that cannot switch receptor activity on and 
off, raises the question of how HNF-4« activity is otherwise regulated. 

The activities of NRs can be regulated by a variety of post-translational 
modifications (PTMs)'*. In the case of HNF-40, two PTMs are well 
described for their ability to regulate receptor properties’*”°. These 
modifications control the receptor’s ability to bind DNA, and by exten- 
sion its ability to regulate gene expression. We identified the quaternary 
sites of these PTMs within HNF-4«. The first site, Arg 91, is a target of 
PRMT1, an enzyme that adds up to two methyl groups to the arginine 
side chain. Arg 91 methylation produces a marked enhancement in the 
DNA-binding activity of HNF-4c'°. The second site, Ser 78, is phos- 
phorylated by protein kinase C (PKC), which disrupts the ability of 
HNF-4@ to bind DNA”. Therefore, taken together, these two PTMs 
act as on and off switches for regulating receptor activity. 

Arg 91 methylation substantially enhances DNA affinity, but is not 
positioned to influence DNA binding directly from its location on the 
DBD farthest from the DNA. Figure 2b shows how its side chain deeply 
protrudes into the LBD-LBD cooperating surface that we described 
earlier as the receptor’s multidomain convergence zone. There is a 
cavity directly above the side chain of Arg91 to accommodate the 
two extra methyl groups, and the extension of the side chain through 
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methylation would more firmly ‘glue’ the DBD junctional interface 
with both LBDs. Therefore, this PTM acts to bias allosterically the 
receptor to bind DNA, by stabilizing the interdomain junctions associ- 
ated with the final productive DNA complex. 

We next analysed the location of Ser 78, the site of PKC phosphor- 
ylation in a number of NRs”’. Along with HNF-4a, other NRs includ- 
ing FXR (also known as NR1H4), RAR-« (also known as NRIB1), 
VDR (also known as NRII1), PPAR-«& (also known as NR1C1), PXR 
(also known as NR1I2) and TR2 (also known as NR2C1) are similarly 
targeted by PKC, which in each case phosphorylates a similarly posi- 
tioned serine on the DBD. Curiously, this serine always resides on 
the ‘wrong side’ of the DNA recognition helix, as is the case in 
HNF-4a, where it seemingly cannot participate directly in DNA bind- 
ing (Fig. 2c)*°*". Yet Ser78 phosphorylation nevertheless weakens 
receptor-DNA binding substantially”. Our structure indicates that 
Ser 78 is positioned to engage the receptor’s interfacial connections 
so as to reduce DNA binding allosterically. Figure 2c suggests how 
an added phosphate group on this residue would create clash, both in 
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Figure 3 | Disease-linked mutations in HNF-4a. a, Summary of MODY1 and 
hyperinsulinaemic hypoglycaemia point mutations identified clinically in 
human populations. b, c, The MODY1 mutations, in many cases (residues in 
red) map to the convergence centre of the receptor domains (blue circle in 
b). d, DNA-affinity measurements of the wild-type (WT) and mutant 
receptors, as described in Fig. le. See also Supplementary Fig. 10 for studies of 
other disease-linked mutations. 
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size and charge, with nearby Tyr 319, a residue that physically connects 
the receptor LBD with the DBD through Ser 78. Phosphorylation 
would compromise the integrity of the quaternary fold needed for 
efficient DNA binding. Allosteric mechanisms of this type cannot be 
understood using the isolated crystal structures of DBDs or LBDs 
alone, as both Arg91 and Ser 78 would appear to be too far from 
DNA-binding surface from that analysis. The current analysis of the 
domain organization, however, shows the unique positioning of these 
residues being consistent with their ability to affect the receptor’s 
DNA-binding function allosterically. 

We next asked whether some point mutations linked to MODY1 
and hyperinsulinaemic hypoglycaemia are similarly positioned in 
sensitive interjunctional surfaces (Fig. 3a). For R76W and R80W 
mutations (associated with hyperinsulinaemic hypoglycaemia), there 
is a simple explanation for receptor dysfunction, as this pair of arginine 
residues directly contacts the AGGTCA half-sites (Supplementary Figs 
9 and 10). V255M alters a residue that points into the LBD pocket, the 
only residue doing so among all the mutations associated with 
MODY 1 and hyperinsulinaemic hypoglycaemia mutations. We found 
a number of mutations that lie at the sensitive domain-domain junc- 
tions of the complex. Sites such as R127W, D126Y, D126H and 
R125W locate to the downstream hinge region where they form 
domain-domain arrangements with the upstream DBD (Supplemen- 
tary Figs 4 and 10). Mutational changes in this hinge site would mis- 
align the interaction between domain-domain surfaces required to 
bridge the two DBDs into register with their successive AGGTCA 
half-sites. Indeed, we find that these mutant proteins substantially 
compromised DNA affinity (Supplementary Fig. 10). This loss of 
DNA binding also translates to a reduction in transcriptional activity”. 
We next examined MODY1 mutations 1314F and R324H, and their 
adjacent residues (R322A, Q318A, D316A and N315A), which were 
found to be on the LBD and at the multidomain convergence centre of 
the complex (Fig. 3b-d). These mutations reduced the DNA affinity 
and transcriptional activity of the receptor (Fig. 3d and Supplementary 
Fig. 11). 

Our investigation of PTMs and MODY1 mutations shows that 
changes introduced in the LBD, the hinge region, or in the DBD away 
from the DNA interface, still affect the DNA-binding properties of the 
receptor at a distance, by communicating through the interdomain 
junctions of the quaternary fold. It is interesting to note the subtlety 
of a single PTM or a single amino-acid mutational change, and the 
large distance with which these signals travel across the polypeptide to 
modulate DNA binding. Therefore, the domain convergence centre 
should be seen as both a sensitive centre for receiving signals, and an 
allosteric transmission system for propagating signals. At the same 
time, it is important to note that the two subunits of the homodimer 
are in altogether different environments owing to the asymmetric 
nature of the two subunits. PT'M sites such as Ser78 or Arg91 will 
have a large influence on the complex only if they occur in the 
upstream DBD. In the same way, some MODY1 mutations would 
appear to be damaging if located in one, but not the other, subunit 
of the homodimer. Owing to the considerable numbers of genes being 
actively controlled by HNF-4« in the liver and pancreas, the loss of 
even a fractional population of functional homodimers caused by 
heterozygous mutations can cause disease. 

As both the HNF-4% homodimer and PPAR-y-RXR-« complexes 
target DRI, we asked if their quaternary architectures were related. The 
common DR1 is expected to establish a similar DBD-DBD spacing in 
these complexes. Figure 4a, b shows the PPAR-y-RXR-« heterodimer 
and the HNF-4« homodimer in an identical way, based on the layout 
of their common DRI sequences. Figure 4c shows the superposition of 
these complexes when their DR1 sequences are aligned to match. 
Indeed, the DBDs occupy nearly identical positions in both DR1 com- 
plexes. Nevertheless, the higher-order quaternary arrangements are 
distinct for these two complexes (Fig. 4). In HNF-40, the LBDs are 
biased towards the upstream DBD, whereas in the PPAR-y—RXR-o 
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complex the LBDs are biased towards the downstream RXR-« DBD. 
Moreover, the PPAR-y-RXR-« complex has its own type of domain 
convergence centre, which is not identical to that in the HNF-4a 
complex. 

The structural comparison indicates that the DNA response ele- 
ment type is not the only driver of quaternary structure in NRs. 
Receptor organization seems to be highly dependent on the constel- 
lation of non-conserved amino acids on these LBD surfaces, and on the 
length and sequence of the hinge segments, which are unique to NR 
members. We also note that DNA recognition is not identical in 
these two complexes. PPAR-y uses its hinge region to recognize an 
additional six base-pair segments located upstream to the DRI core 
element, establishing the polarity of subunits in that heterodimer. 
HNF-4« subunits do not use their hinge regions for DNA recognition, 
nor do they contact sequences outside the core DRI. 

Our crystallographic findings with both NR complexes do not 
support the notion of a ‘common architecture’ for full-length NRs”’. 
Our findings also dispel the view that NR polypeptides are arrays of 
‘domains on a string’, each of which confers its own independent 
function without physical and functional integration. The repertoire 
of quaternary structures in the NR family is likely to be diverse, even 
though both the DBDs and LBDs are conserved. This expectation 
stems from the fact that neither hinge regions nor LBD surface residues 
are conserved in the NR family, yet these features are the key drivers of 
quaternary folding. The multiple response element configurations 
used in the NR family are another driver of quaternary organization. 

Mounting evidence points to the importance of interdomain com- 
munication in the NR family. For oestrogen receptors, the activities of 
ligands are influenced by the response elements, and DNA can also 
influence coactivator binding™. In the glucocorticoid receptor, small 
conformational changes in the DBD propagate across the receptor to 
influence the LBD, and in the androgen receptor there is also evidence 
of DBD-to-LBD communication”*”*. Our findings reveal that PTMs 
can modulate the interdomain connections in the quaternary fold. It 
has been reported that certain PPAR-Y ligands can selectively block the 
phosphorylation of PPAR-y2, indicating communications between 
the LBD pocket and the site of phosphorylation’”**. Ser273 in 
PPAR-y is positioned within a domain-domain junction of the 
PPAR-y-RXR-o complex (Supplementary Fig. 12). From its position, 
the phosphorylation state of Ser 273 can communicate with the PPAR- 
y ligand-binding pocket and with the DNA-reading heads of the 
PPAR-y-RXR-« heterodimer. 

For HNF-4a, small molecules directed at the sensitive interjunctional 
sites may prove to be beneficial for treating MODY1 patients in which 
the DNA-binding properties have been mutationally compromised. To 
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Figure 4 | Comparison the HNF-40, homodimer and the PPAR-y-RXR-a 
heterodimer complexes on DR1 DNA. a, The PPAR-y-RXR-« heterodimer 
on DRI. b, The HNF-4x% homodimer on DRI. c, Their overlap when the DRI 
sequences are superimposed, showing the distinct domain-domain 
arrangements in these two complexes. The two complexes are shown in an 
identical fashion with respect to the DNA sequence facing the viewer. 
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find these molecules, high-throughput screening efforts must target the 
complete architecture of this receptor and not just the isolated LBD. We 
have identified two locations in the quaternary structure of the HNF-40 
complex that appear to be accessible for the binding of small-molecule 
allosteric modulators (Supplementary Fig. 13). An expanded under- 
standing of the physical connectivity between LBDs, DBDs and other 
domains in the NR family should expand and better guide the discovery 
of receptor modulators with therapeutic value. 


METHODS SUMMARY 


The HNF-4« protein segment lacking the AB and F regions, corresponding to 
residues 46-368 (National Center for Biotechnology Information (NCBI) acces- 
sion NM_178849), was expressed in E. coli. The purified protein was then com- 
bined with synthetic duplex DNA (DR1) and coactivator peptide, further purified 
as a complex, and subjected to crystallization trials. X-ray diffraction data were 
collected at the Argonne National Laboratory SBC-CAT 19ID beamline and the 
structure was solved by molecular replacement, using the previously determined 
structures of the HNF-4% LBD and DBD. Biochemical studies measuring DNA 
affinity were measured using wild-type and mutant proteins prepared using 
QuikChange Site-Directed Mutagenesis (Stratagene). The DNA was fluorescein 
labelled at the 5’ end of the top strand, and fluorescence polarization anisotropy 
measurements were used to obtain Kg values. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Expression, purification and crystallization. The HNF-4« proteins used in this 
study reference the NCBI sequence for HNF4, accession NM_178849. All HNF-4a 
(human) constructs, including FL (1-464), AABAF (46-368), AF (1-368), AAB 
(46-464) and DBD (46-126) and others were expressed from pET46 Ek/LIC 
vector in BL21(DE3) E. coli cells (Novagen). The HNF-4% AABAF (human) 
construct was used in crystallization experiments. Cells were induced with 
0.5mM IPTG at 17°C for 16h, and lysed in 20mM Tris (pH 7.5), 500mM 
NaCl, 20 mM imidazole and 10% glycerol. The purification used involved His- 
Bind resin (Novagen), SP-Sepharose column (GE Healthcare) and gel filtration 
(Superdex 200). Purified HNF-40 was then combined with oligonucleotide (DR1) 
in a 1:1.5 molar ratio. The oligonucleotide strands 5'-GGAACTAGGTCAAAGG 
TCAG-3’ and 5’-CCTGACCTTTGACCTAGTTC-3’ were purified and annealed. 
Coactivator peptide (EKHKILHRLLQDSY) was also added in a 3X molar ratio. A 
final gel-filtration step was carried out to remove any excess DNA. Crystals were 
grown with 5-10% PEG 3350, 25 mM MgCl, 25 mM NH,Cl, 10 mM dithiothrei- 
tol (DTT) and 0.1 M MES pH 6.5 at 4°C. 

Data collection and structure determination. Diffraction data was collected at 
the Argonne National Laboratory SBC-CAT 19ID beamline and the structure was 
solved by molecular replacement. X-ray data were collected at a wavelength of 
0.9793 A, at 100 K. The backbone dihedral (Ramachandran) angles for the amino 
acids in the final coordinates conform to preferred, allowed and disallowed stati- 
stics of 90.5%, 8.2% and 1.3%, respectively. The details of X-ray diffraction data 
collection, structure solution and refinement are in Supplementary Table 1. 
Fluorescence polarization assay. All the mutants used in DNA-binding assays 
were prepared using QuikChange Site-Directed Mutagenesis (Stratagene). The 
DNA strands 5’-GGAACTAGGTCAAAGGTCAG-3’ and 5’-CCTGACCTTTG 
ACCTAGTTC-3’ were annealed to make the DR1 for binding studies. For binding 
assays, 2nM fluoresceinated DNA (5'-end conjugation on the top strand) was 
incubated with purified HNF-4« protein for 2h at room temperature (23 °C). 
Protein concentration was varied by serial dilution in binding buffer (20 mM 
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Tris-HCl, pH 7.8, 20mM NaCl, 8% glycerol, 10mM DTT). The fluorescence 
polarization signals were recorded using 96-well black polystyrene plates on 
FlexStation 3. The data were later converted to fluorescence anisotropy values 
and normalized. The Kg of each construct and mutants were calculated by fitting 
the curve in KaleidaGraph 4.1. 

Interactions of the A/B and F domains within the HNF-4a complex. For the 
study shown in Supplementary Fig. 1D, the AB domain (residues 1-45) and F 
domain (369-465) of human HNF-4« were expressed from pET46 Ek/LIC vector 
in BL21(DE3) E. coli cells (Novagen). Each protein was purified using a HisTrap 
column, followed by size-exclusion chromatography on a HiLoad 16/60 Superdex 
200 column. The final samples were prepared in 10 mM phosphate (pH 7.0) and 
100 mM NaCl buffer for fluorescein labelling. Ten micrograms of each peptide, 
HNF-40-AB, HNF-4a-F and PGC-1a% LXXLL-motif peptide (AEEPSLLKKLLAY, 
synthetically made and purchased from AnaSpec) were incubated with fluores- 
cein-5-EX succinimidyl ester at a molar ratio of 1:2 in 100 mM potassium phos- 
phate (pH 7.0) coupling buffer at 37 °C for 60 min, quenched by adding 100 mM 
Tris (pH 8.0) and incubated for 30 min at room temperature. The labelled peptides 
were purified using Sephadex G-15 column for fluorescence polarization measure- 
ments. The HNF-4%AABAF-DNA complex was prepared for the peptide-binding 
assays as described earlier. Five nanomolar of each fluorescein-labelled peptide 
was incubated with HNF-4a-DNA complex of serially diluted concentrations for 
2h at room temperature. Recording of fluorescence polarization signals and data 
processing were conducted as described earlier. 

Transcription reporter assays. For the transcriptional reporter studies shown in 
Supplementary Fig. 11, we used both HEK293T and COS-7 cells that were seeded in 
24-well plates and 1 day later transfected with 400 ng of the pCMV-Tagl-HNF-4a 
(46-368) wild-type or mutant plasmid, 100 ng of apoCIII-pTKLuc reporter (a gift 
from D. Kelly) and 10ng of pRL (control Renilla luciferase) using jetPEI regent 
(Polyplus) according to the manufacturer’s protocol. Luciferase activity was mea- 
sured 48 h after transfection using the Dual-Glo Luciferase Assay System (Promega) 
and data were normalized by the relative ratio of firefly and Renilla luciferase activity. 
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REPRODUCTIVE BIOLOGY 


Breeding 


opportunities 


The reproductive sciences offer an unusual variety of 
career options — but some are more fertile than others. 


BY CHARLOTTE SCHUBERT 


elp to implant a human embryo; 
Hes mouse eggs divide after fer- 
tilization; interview women about 
their experiences with emergency contra- 


ception; collect sperm from finches in the 
Galapagos Islands off Ecuador: specialists in 


reproductive biology undertake these duties 
and many more. The field provides a surfeit 
of career trajectories and research questions. 
“We cover everything from fertilization to 
death,” says Dolores Lamb, director of the 
Center for Reproductive Medicine at Baylor 
College of Medicine in Houston, Texas. 

The reproductive sciences also touch on 


human life and biology at almost every level, 
from molecular and cellular events such as 
the recognition of the egg by sperm, to whole- 
body processes including hormonal regulation 
of puberty and population-level questions such 
as what factors affect teenage-pregnancy rates. 

“Tt’s amazing, the diverse backgrounds that 
all somehow feed into reproductive medicine,’ 
says Lamb. Many researchers study reproduc- 
tion as part of a doctorate in reproductive or 
developmental biology. Others might find 
their way to the field through cell biology 
(focusing on sperm stem cells, for example) or 
animal sciences (perhaps studying cattle hor- 
mones). Attendees at the annual meeting of the 
Society for the Study of Reproduction, which is 
based in Madison, Wisconsin, hail from a wide 
range of disciplines. 

Diversity does not guarantee jobs, however, 
and positions are scarce in many areas of indus- 
try and academia. Richard Sharpe, who heads 
the graduate programme at the UK Medical 
Research Council’s Centre for Reproductive 
Health at the University of Edinburgh, says 
that three years ago he assured students that if 
they excelled, they could find work. “We can no 
longer say that,” he says. Academic opportuni- 
ties have shrunk as the recession has taken a 
bite out of budgets and funding priorities have 
shifted to areas such as chronic disease, says 
Sharpe. (Although location does matter; see 
‘Renminbi for reproductiom) In the reproduc- 
tive sciences, as in most life sciences, the job 
market has constricted, agrees Michael Skinner, 
founder of the Center for Reproductive Biol- 
ogy at Washington State University (WSU) in 
Pullman. 

Even so, he says, studying reproduction will 
give doctoral students an edge with employ- 
ers from fertility clinics to animal-agriculture 
companies. And areas such as global health 
continue to grow. 


LOOKING TO ACADEMIA 

WSU graduates have landed academic jobs in 
everything from toxicology to oncology, says 
Skinner, who notes that much cancer research 
focuses on reproduction-related cancers such 
as breast and prostate. These days, “you have to 
market yourself broadly’, says Tracy Clement, 
who earned her PhD with Skinner and is look- 
ing for an academic post. “It does not feel like a 
good time to be on the job market.” 

If she does get a university job, Clement will 
face funding difficulties. The major source of 
US research money in the field is the National 
Institute of Child Health and Human > 
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> Development in Bethesda, Maryland, but 
last year only 12.5% of grant applications for 
major research projects to the institute were 
successful — less than at many other National 
Institutes of Health (NIH) institutions. 

In the future, says Skinner, academics will 
require support from multiple sources — 
something for which the broad field is particu- 
larly suited. Skinner, for instance, has received 
funding not only from the NIH, but also from 
the Bill & Melinda Gates Foundation in Seattle, 
Washington, to develop a male contraceptive 
and from the US Department of Defense to 
study how exposure to environmental toxi- 
cants affects subsequent generations. He has 
also applied to the John Templeton Founda- 
tion, based in West Conshohocken, Pennsyl- 
vania, to study the role of epigenetics in finch 
evolution in the Galapagos. 


MIXED OUTLOOK IN INDUSTRY 

The pharmaceutical industry has shed jobs in 
contraception over the past 10 years, in part 
because of nervousness about the side effects 
of reproduction-related drugs, which have 
drawn numerous lawsuits. Companies such 
as Wyeth — acquired in 2009 by Pfizer in New 
York — and Bayer, based in Leverkusen, Ger- 
many, have cut back or dropped entire research 
programmes. 

Daniel Johnston, a former principal research 
scientist at Pfizer who was laid off in 2010, 
could not find a job directly related to human 
reproduction at a pharmaceutical or biotech- 
nology company despite more than 5 years of 
pharmaceutical experience in contraception 
and women’ health. But while at Pfizer, John- 
ston had begun to work in oncology — and 
that experience helped him to find work at a 
company focused on liver cancer. 

Johnston's lateral move is not unusual, given 
that reproduction overlaps with many areas 
(Johnston compares a testis to a tumour; both 
have a low-oxygen core of rapidly dividing stem 
cells — but one produces sperm instead of can- 
cer cells). Susan Fisher, director of translational 
research in perinatal biology and medicine at 
the Center for Reproductive Sciences of the 
University of California, San Francisco, says 
that adaptability gives her graduates a leg up 
in industry. They have found jobs at biotech- 
nology companies focused on oncology, stem 
cells and the rapidly expanding area of pre- 
natal genetic testing, a market that could soon 
be worth more than US$1 billion yearly (see 
Nature 486, 454; 2012). Companies competing 
in this area include Ariosa Diagnostics in San 
Jose and Natera in San Carlos, both in Cali- 
fornia. Skinner advises students and postdocs 
interested in an industry job to get training in 
the broadest possible range of lab technologies, 
including genomics. 

Reproductive biologists are also finding 
work in animal agriculture. “T get calls maybe 
two or three times a year from companies 
looking for a master’s- or PhD-level scientist 


to run a lab,” says Derek McLean, a biologist 
who studies cattle, pig and mouse reproduc- 
tion at WSU. 

Scientists who can store and manage ani- 
mal semen and test fertility products for 
female livestock are in demand at companies 
and organizations such as Select Sires, a fed- 
eration of farmer 
cooperatives based 
in Plain City, Ohio, 
that provides live- 
stock-breeding ser- 
vices. Candidates 
for such posts have 
generally trained ina 
reproductive-biology 
laboratory and have a 
degree in animal sci- 
ence. The job mar- 


“You have to ket is steady in the 
prepare f or United States, and is 
option one — but — expanding in Brazil 
have optiontwo and other emerging 
andthreeinthe countries that are 
wings.” moving towards a 


more industrialized 
animal-agriculture 
system, says McLean, who is leaving WSU this 
autumn for Phibro Animal Health in Teaneck, 
New Jersey. 

Genomics opportunities are also emerg- 
ing at companies that are developing tests for 
genetic selection of farm animals; one such 
employer is Illumina in San Diego, Califor- 
nia, which markets a high-density Bovine 
BeadChip, a genetic array that detects traits 
in cattle. 

Fertility clinics are an option for those look- 
ing for something more applied — and perhaps 
more likely to have an immediate impact on 


Tracy Clement 


people's lives. There are about 400 clinics in 
the United States, and they often hire PhD- 
level scientists to manage a staff of bachelor’s- 
and master’s-level technicians involved in 
tasks such as culturing and freezing human 
embryos and performing hormone assays. 
Such scientists often interact with patients 
—a lead embryologist, for instance, might 
contact patients with test results, and assist 
during implantation of the embryo. “Tt brings 
a lot of joy in our profession to help people to 
have a family, says Pierre Miron, who heads a 
fertility clinic near Montreal in Canada. “The 
interaction with patients is a great part of the 
job? Jobs at clinics affiliated with universities 
often provide more opportunities for fertility 
research than private clinics, says Miron. 

The government of Quebec province began 
paying for in vitro fertilization in 2010 as part 
of the national health-care system, leading to 
rising demand for services. Many clinics are 
filling job slots with trainees from Réseau 
Québécois en reproduction (the Quebec 
Reproduction Network), a consortium of 
more than 80 researchers at institutions such 
as McGill University in Montreal. But in most 
of the world, jobs in fertility clinics are com- 
petitive. Strong candidates not only have the 
right human touch, says Miron, but also have 
training in a range of techniques, including 
sperm and egg manipulation. 


GLOBAL IMPACT 

Global health may be a more fruitful area for 
reproductive-biology graduates. “My personal 
feeling is that there is now a stronger market 
for people working at the population level 
than at the molecular level in the reproductive 
sciences,’ says Ward Cates, president emeritus 
of FHI 360, a global-health organization based 


ASIAN OUTLOOK 


Renminbi for reproduction 


Thirteen years after earning his PhD, and 
following two stints as a postdoc, Minghan 
Tong, a reproductive biologist at Washington 
State University in Pullman, has finally 
landed a tenure-track research position. 
Unable to find such a job in the United 
States, Tong is heading to greener pastures 
in his native China. In September, he will 
begin work at the Shanghai Institutes for 
Biological Sciences. Tong and others are 
applying for a 39-million-renminbi (US$6.3- 
million) grant to study the epigenetic 
regulation of sperm production. 

China is a bright spot in the tight market 
for academic jobs in reproductive sciences. 
The sperm-production grant is part of a 
major funding initiative in the field by the 
Chinese government, which is interested 
in new approaches to contraception and in 
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the reproductive impacts of environmental 
problems such as toxicants that may 
damage sperm and eggs by tweaking the 
epigenome to affect multiple generations. 

In 2007, the Chinese Ministry of Science 
and Technology designated development 
and reproductive sciences as one of four 
core research areas, ramping up grant 
funding. This year, the ministry is expected 
to fund four 5-year projects in the field, each 
worth more than 24 million renminbi. 

Other national and local agencies have 
also increased funding, spurring the 
establishment of dozens of research centres 
focused on reproductive sciences and 
creating hundreds of jobs, says Qinghua Shi, 
a professor of life sciences at the University 
of Science and Technology of China in 
Hefei. €.S. 


STEVEN R SHAW/IMAGE ASSOCIATES 


in Durham, North Carolina. He points to 
Family Planning 2020, an ambitious ini- 
tiative to roll out contraceptive services to 
120 million girls and women in develop- 
ing countries by 2020. Donors including 
the Bill & Melinda Gates Foundation and 
governments of both developed and devel- 
oping nations have pledged $2.6 billion to 
the programme, which was launched last 
July at a meeting in London spearheaded by 
Melinda Gates. “This will define the future 
for public-health jobs in the reproductive 
sciences,’ says Cates. 

He adds that the initiative will create 
jobs, mostly in the developing world, for 
researchers who know how to cost-effec- 
tively implement such services and for 
scientists who can evaluate their impact 
— by, for instance, assessing the uptake of 
contraception and its effects on popula- 
tion growth and women’s and children’s 
health. The effort will require research- 
ers with backgrounds in areas such as 
demography, sociology, economics and 
public health. 

Cates says that researchers with a 
basic-science background in reproduc- 
tive sciences and extra training in fields 
such as epidemiology often have a leg up 
when competing for jobs in areas includ- 
ing clinical-trial design, because of their 
understanding of biology. 

Patricia Sadate-Ngatchou earned a PhD 
studying sperm development at WSU. 
But a visit home to Cameroon during a 
major cholera outbreak in 2010 changed 
the course of her career. “How do you help 
people on the ground?” she asked herself. 

Sadate-Ngatchou is now studying for 
a master’s degree in epidemiology at the 
University of Washington in Seattle. Her 
ultimate goal is to move into a decision- 
making position in government or a foun- 
dation involved in reproductive health; 
a suitable post might be as a programme 
officer overseeing grants. However, Sadate- 
Ngatchou thinks that she may first have to 
do entry-level work as an epidemiologist, 
for instance in disease surveillance. 

The variety of questions and opportu- 
nities in reproductive biology keeps some 
researchers hooked on the field, despite the 
tough market. Some end up in niches they 
never expected, such as facilitating panda 
or reptile reproduction in zoos, or assessing 
toxicants for their effects on embryonic and 
pubertal development at government insti- 
tutions such as the US Environmental Pro- 
tection Agency. Clement is open to a variety 
of possibilities. “If you are a reproductive 
biologist,” she says, “you have to prepare 
for option one — but have option two and 
three in the wings.” m 


Charlotte Schubert is a freelance writer 
based in Seattle, Washington. 


TURNING POINT 


Bruno Reversade 


A molecular biologist at the Agency for 
Science, Technology and Research (A*STAR) 
Institute of Medical Biology in Singapore, 
Bruno Reversade is the first scientist based 
outside Europe to win the European Molecular 
Biology Organization Young Investigatorship 
Award, which he collected last November for 
his work on genetics and twinning. 


Do you thrive in a competitive environment? 
Yes. I realized that university would be com- 
petitive when one of my first professors said 
that 60% of the class would not make it to the 
second year. After that, I sat in the front and 
worked hard. 


What led to your fascination with embryonic 
development? 

I went to the University of Western Ontario 
in London, Canada, in my fourth year as an 
undergraduate and worked on early zebrafish 
development. The developing embryo was so 
beautiful and fundamental to life that I realized 
it was a special area. At the time, medicine and 
biology were all about identifying and treating 
disease, but I found a resonance with birth and 
development. I went to the Pasteur Institute in 
Paris for a year to work on early head develop- 
ment after seeing a knockout mouse with no 
head on the cover of Nature (W. Shawlot and 
R.R. Behringer Nature 374, 425-430; 1994). 


What was the biggest challenge of your PhD? 

I did most of my research at the University 
of California, Los Angeles, where I spent the 
first half of my programme chasing chordin, 
a protein that my adviser, Eddy De Robertis, 
and I thought was circulating in the blood. 
After three years we found we were mistaken. 
I persevered, however, and we detailed how 
multiple proteins help embryos that are cut 
in two to self-regulate consistently. Eddy and 
I published the work in Cell (B. Reversade and 
E. M. De Robertis Cell 123, 1147-1160; 2005). 


What led you to focus on genetics in twinning? 

Sitting in the lab cutting frog embryos day after 
day led me to a defining realization. Identical 
twins occur once in every 300 births, more fre- 
quently than most genetic diseases. The dogma 
at the time was that twinning just happens, 
but I started to look for evidence of a genetic 
trigger. Then Hanan Hamamy, a genetic cli- 
nician who at the time was at the Jordanian 
National Center for Diabetes, Endocrinology 
and Genetics in Amman, identified 13 pairs 
of identical twins across multiple generations 
of a single family — hinting at a genetic link. 


She kindly invited me to work with her. 


How were you able to rush out to Jordan? 

I wanted to test my ideas as quickly as possi- 
ble, and I didn’t want to do a postdoc. Perhaps 
that was arrogant or unrealistic, but I wanted 
to be independent. Fortunately, Swiss philan- 
thropist Branco Weiss was seeking young sci- 
entists who were pursuing a biological problem 
with societal impact for the Society in Science 
fellowship. I met Branco and explained that 
identical twins can develop through several 
mechanisms, including embryonic bisection 
and possibly genetics, which for me calls into 
question the moral uproar over cloning. I con- 
vinced him that the idea was worth pursuing 
and he gave me the money. 


Have you published this work? 

Not yet. ’m now working with samples from 
other families with multiple sets of twins. We 
found a gene that is overexpressed in identical 
twins and encodes a protein. We are making 
sure it is well protected by patents. 


What is your most important career move so far? 
In 2008, I was the first A*STAR investigator 
recruited as an assistant professor at the Insti- 
tute of Medical Biology. They offered me carte 
blanche: I have no teaching or grant-writing 
responsibilities. Everything was new and the 
country was investing so much in science. I have 
blossomed here because I got that freedom just 
as the revolution in human genetics began. 


What do you plan to do next? 

I want to work on rare diseases ranging from 
developmental anomalies to inherited cancers. 
If you want to understand a trait in the general 
population, you need to look at the outliers. = 


INTERVIEW BY VIRGINIA GEWIN 
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Ua SCIENCE FICTION 


THE SEPARATISTS 


BY KJ KABZA 


he last thing I remembered was 
Tit Hot chair and the grey CAT-15 

cable jamming into the jack socket 
in my skull. I knew what that meant. 

I rolled over in bed. My new, standard- 
ized body smelled of the stasis tank and 
moved with clumsy naivety. 

“Car bomb,’ said Sara. 

I looked up. She stood at my bedside 
like a coroner, gazing down impassively at 
another piece of meat. Her tone was flat 
and her face indifferent stone. “Outside the 
Smithsonian. The Separatists again.” 

I turned away from her dead eyes. Even 
after all this, her loathing is what kills me. 
I know my continuity is a myth, and ’m 
nothing but an echo. But her eyes never say, 
At least I've got the echo. They say, You are a 
monster — a pre-made flesh lattice, stamped 
with my dead husband’ face. 

“Oh, I said. 

We waited in silence for the doctor. 


In 15 days I was back above ground, dressed 
ina dark grey suit and a red power tie. Smil- 
ing and confident. Tone warm yet brisk. Sara 
at my side, one hand on my arm, smiling, 
always smiling. How long could we go on 
like this? Mr President, how are you feeling? 
Mr President, are you going to reopen talks 
with Alabama? Mr President, how do you 
respond to allegations of terrorism against 
the Confederacy of Texahoma? 

While I was out, I missed the secession 
of South Missouri. The Secretary of Home- 
land Integrity said theyd left nine days ago, 
issuing a statement with the by-now famil- 
iar elements: time to leave a once-great 
nation, fallen into extreme financial and 
social decay, too divided to fight for what 
we needed. Time for something radical. 
“And by the way,’ said the secretary, “South 
Missouri's so-called Prime Minister is agitat- 
ing for war.” 

In times like these, Dr Ramaran had said, 
after dubbing my makeshift immortality 
‘Operation Phoenix’ the President has to 
seem untouchable. 


Sara came to bed one night while I was still 
awake. “Nick.” 

“Yes?” 

Around us, the darkness rustled as she 
burrowed under the blankets. Her voice 
was distant, spoken to the wall at my back 
instead of me. “Before you died last time” 


A radical element. 


“Don't.” 


She paused. I knew Id said the wrong 
thing, but how could I ever say anything else? 

Sara went ahead. “My mother called to 
talk about Lacey. And... youand I sat in the 
garden, for a while” 

She expected me to nod. SoI did. 

“And you complimented the President of 
Transnistria on her perfume.” 

I bit my tongue. She kept talking, placing 
the unimportant moments in a careful row, 
as if the order of their occurrence could give 
them any meaning. Like I always did, I pre- 
tended she was debriefing me on the loose 
ends accumulated since the last back-up, and 
not reminding me, again, in every possible 
way, of what shed lost. 

Her litany trickled into nothing. Empty 
minutes passed. We listened to each other's 
breaths, the only proof of ourselves in the 
darkness. 

Isaid, “I'm sorry.’ 

Colourlessly: “I know.” 

“Ifit were up to me —” 

“Ttismt” 

“If it were up to me, I repeated, “I 
would’ve run away with you after the first 
time. Or maybe I just should've done it any- 
way, consequences be damned.” 

The blankets pulled as she sat up. “You 

mean the first one of 


> NATURE.COM you should’ve” 
Follow Futures: “Sara, what can I 
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husband back.” 

“Goddammit —” 

“You can die,’ she said, “but a copy of 
a sixth-generation copy would just take 
your place. You could divorce me, but the 
Separatists would be all over that. You 
could even blow up that goddam labora- 
tory. They still haven't admitted to mind 
transfers in humans with anyone other 
than you. How long are you planning on 
going on like this? 

“Or you know what? You could murder 
me. You might as well, for what I feel every 
time you —” Her voice broke. “— every 
time another one of you comes into my 
house” 

“Sara —” 

“I could’ve lived with it after the first 
time, Nick. When your first copy was as 
scared as I was, and neither one of us knew 
what it all meant, and he thought he wasn't 
the same person either. But I don't even 
know what you are, now.’ 

“T[know I'm not the same person. But —” 

“Youre not even close, Nick” Only then 
did she start to cry. “You're not close enough 
at all?” 

I got out of bed and left the room, the 
agents outside pretending that they didn’t 
hear her sobs. 


I went into the lab that night, alone. I'd say I 
didn’t know what I was thinking, but that’s a 
lie. I was thinking of Sara. 

The stasis tanks awaited me, where 
hairless, humanoid lattices lay in indefi- 
nite repose. One was already prepped and 
imprinted with my face, and I set that tank 
to drain. 

Ilaid the body ona bed, inserted the CAT- 
15 cable into the socket at the base of the 
skull, and accessed the Hot computer, where 
every back-up of my consciousness resides. 
I found the oldest conglomerate that was 
flagged as ‘utilized’ — the only one Sara had 
said she could live with — and prepared the 
download. 

While I waited, I wrote out a note for him 
on a piece of scrap paper: 

“Run away with Sara. It’s time for some- 
thing radical” = 


KJ Kabza’s work has appeared in F&SF, 
Daily Science Fiction, Beneath Ceaseless 
Skies, AE: The Canadian Science Fiction 
Review and others. Visit him at 
www.kjkabza.com or follow him on 
Twitter @K]Kabza. 


JACEY 


BRIEF COMMUNICATIONS ARISING 


A Late Eocene date for Late Triassic bird tracks 


ARISING FROM R. N. Melchor, S. De Valais & J. F. Genise Nature 417, 936-938 (2002) 


Bird-like tracks from northwest Argentina have been reported as 
being of Late Triassic age’. They were attributed to an unknown group 
of theropods showing some avian characters. However, we believe 
that these tracks are of Late Eocene age on the basis of a new weighted 
mean ~’°Pb/***U date (isotope dilution-thermal ionization mass spec- 
trometry method) on zircons from a tuff bed in the sedimentary 
succession containing the fossil tracks. In consequence, the mentioned 
tracks are assigned to birds and its occurrence matches the known 
fossil record of Aves. 

The redbed sequence of the former Santo Domingo Formation 
yielded several-hundred bird-like footprints, which were assigned 
to Gruipeda dominguensis (the most common ichnotaxon), cf. 
Alaripeda isp., and another taxonomically indeterminate bird-like 
footprint’. The age of the stratigraphic unit was considered to be 
Late Triassic on the basis of known fossil wood remains and geochro- 
nological information from basalt lava flows thought to be inter- 
bedded in this unit'’*. Further geological studies revealed that the 
Santo Domingo Formation contains several thrust sheets of different 
ages, and that the trace-fossil-bearing horizons belong instead to the 
recently proposed Laguna Brava Formation, in a thrust sheet separate 
from the one that contains the dated basalt and fossil wood remains’. 
A 12-cm-thick crystal-rich ash-fall tuff within the thrust sheet with 
the bird-like footprints was sampled for this study for U-Pb zircon 
geochronology (supported by US National Science Foundation grant 
EAR 0931839 and ANPCyT PICT 13286 from Argentina). This tuff 
lies 38 m below the first layer with definite G. dominguensis and 124m 
below the main horizon with hundreds of G. dominguensis°. There is 
no stratigraphic discontinuity between the tuff bed and the footprint- 
bearing levels. Zircon grains were separated from the tuff using con- 
ventional methods and were dated using high-precision chemical 
abrasion-isotope dilution-thermal ionization mass spectrometry 
(CA-ID-TIMS)’*. The tuff yielded abundant clear, long bipyramidal 
(150 jt1m and 250 jim) and sharply faceted zircons. The five youngest 
analyses from a total of nine form a coherent cluster with a weighted- 
mean **°Pb/**°U date of 37.222 + 0.018/0.024/0.047 million years (Myr) 
ago (internal uncertainties/with tracer calibration uncertainties/with 
decay constant uncertainties; mean square of weighted deviation = 1.6) 
(Fig. 1 and Table 1). 


Therefore, we suggest that the maximum age for the bird tracks 
of the Laguna Brava Formation of northwest Argentina (including 
G. dominguensis) is Late Eocene (Bartonian/Priabonian’), and matches 
the known fossil record of birds. Recent studies’? documenting possible 
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Figure 1 | Date distribution plot for analysed zircons from tuff sample 
S$D04-7 from the former Santo Domingo Formation (now considered as 
part of the Laguna Brava Formation). The y axis shows measured 
°°SPb/?U age (see Table 1 for complete analytical data). Data acquisition, 
reduction, error propagation and plotting were done using EARTHTIME 
supported statistical methods’* and the U-Pb_Redux software package’’. Bar 
heights are 2o analytical uncertainties; dark/light grey boxes signify analyses 
included/excluded from age calculation, respectively. The shaded horizontal 
band and its width represent the calculated weighted mean age and its 20 
analytical error. MSWD, mean square of weighted deviation; n, number of 
analyses included in the calculated date. 


Table 1| U-Pb data for analysed zircon from tuff sample of the Laguna Brava Formation 


Composition Ratios Age (Myr ago) 

Sample Phot Pb*t/ Th/ —2°6Pbs/ 208 Pb ||/ 206P hq / Error = °°7Pb§/-—s Error = ®°7Pb§//-_—s Error = ?°®Pb/ Error 207Pb/ 207Pb/ Corr. 
fractionst (pg) Pg U 204Pb 206Ph 238y (20%) 235\ (20%) 206Pb (20%) 238y (20) 235y 2°6Ph coef 
SD04-7 

z2 0.8 78 090 441.2 0.287 0.005821 (0.19) 0.03811 (2.31) 0.04750 (2.27) 37417 0.071 37.98 74 0.23 
z3 14 21.1 0.74 1,211.7 0.238 0.005804 (0.15) 0.03765 (0.83) 0.04707 (0.81) 37.305 0.054 37.53 52 0.26 
z4 0.7 218 0.65 1,281.5 0.207 0.005785 (0.12) 0.03728 (0.90) 0.04676 (0.87) 37.186 0.043 37.16 36 0.32 
z5 05 48.2 0.67 2,792.1 0.214 0.005791 (0.10) 0.03753 (0.39) 0.04702 (0.36) 37.221 0.037 37.41 494 0.33 
z7 05 22.1 0.62 1,305.7 0.198 0.005802 (0.15) 0.03752 (0.82) 0.04692 (0.80) 37.295 0.054 37.40 44 0.27 
z8 13 11.2 059 4673.2 0.190 0.005792 (0.13) 0.03747 (1.52) 0.04694 (1.49) 37.228 0.047 37.35 45 0.27 
z9 0.7 265 0.62 1,561.8 0.199 0.005811 (0.15) 0.03737 (0.72) 0.04666 (0.69) 37.350 0.057 37.25 31 0.32 
z10 16 13.7 0.73 791.2 0.234 0.005785 (0.18) 0.03727 (1.83) 0.04674 (1.73) 37.187 0.066 37.15 35 0.60 
zi11 05 466 0.59 2,756.2 0.189 0.005794 (0.08) 0.03728 (0.44) 0.04669 (0.41) 37.244 0.029 37.17 32.4 0.43 


Corr. coef., correlation coefficient. Pb* is radiogenic Pb concentration. Age calculations are based on the decay constants of ref. 11. 
+All analyses are single zircon grains and pre-treated by the thermal annealing and acid leaching (CA-TIMS) technique. Data used in age calculations are in bold. 


$Pb, is total common Pb in analysis. 
§Measured ratio corrected for spike and fractionation only. 
\|Radiogenic Pb ratio. 


Corrected for fractionation, spike, blank and initial Th/U disequilibrium in magma. All common Pb is assumed to be blank. Total procedural blank was less than 0.1 pg for U. Blank isotopic composition: 


206Ph/24 Ph = 18.42 + 0.35; 2°7Pb/2 Pb = 15.36 + 0.23; 2°SPb/?Pbh = 37.46 + 0.74. 
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landing trace fossils and probing marks closely associated with 
G. dominguensis strongly suggested a younger age for the host sedi- 
mentary succession. Re-definition of stratigraphic units in this region 
and further supporting geological and palaeomagnetical evidence have 
been published elsewhere’. 
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Updated structure of Drosophila 


cryptochrome 


ARISING FROM B. D. Zoltowski et al. Nature 480, 396-399 (2011) 


Recently, we determined the X-ray crystal structure of full-length 
cryptochrome from Drosophila’. Here we report an improved model 
of the Drosophila cryptochrome (dCRY) structure that corrects errors 
in the original coordinates (Protein Data Bank (PDB) accession 3TVS). 
Further refinement of the structure, with automated rebuilding algo- 
rithms in Phenix’ followed by manual building, indicated that a model 
of dCRY could be produced with excellent refinement statistics with- 
out taking into account the non-merohedral twinning originally 
reported (Table 1). 

The rebuilt structure has a root mean squared deviation (r.m.s.d.) 
on Cz positions of 2.4 A compared to the deposited coordinates, with 
most differences found in the conformation of surface loops (Fig. 1). 
However, the new analysis also indicates that the sequence register of 
the carboxy-terminal tail (CTT) helix is displaced by two residues 
(Fig. 2). This change in sequence register offsets the invariant FFW 
motif along the helix axis such that Phe534, and not Trp 536, 
approaches closest to the flavin ring (Fig. 3). In the new model, the 
three residues composing the FFW motif continue to make extensive 
interactions with the photolyase homology domain. This new posi- 
tion of the FFW motif is more consistent with the cellular data of 
Supplementary Fig. 6 of ref. 1, which shows that substitution of FFW 
to three alanine residues has a marked effect on dCRY stability, but 
that the Trp536Ala substitution alone does not. The configuration of 
the flavin centre is similar between the old and new models, with 
the largest difference in the angle of the ribityl-to-flavin (N10) bond 
(Fig. 4). Phosphorylation of Thr518 is not apparent in the new 


Table 1| Data collection and phasing statistics 


Native 
Data collection 0.97918 
Space group P2, 
Cell dimensions 
a, b, c (A) 72.8, 112.3, 75.0 
a, B,y 90.0, 114.9, 90.0 
Resolution (A) 30-2.30 (2.38-2.30)* 
Rsym 0.126 (0.354) 
l/c (I) 11.9 (1.8) 
Completeness (%) 90.1 (57.3) 
Redundancy 3.9 
Refinement | 3TVS (detwinned/native) Rebuilt (detwinned/native) 
Resolution (A) 30-2.3 (2.30-2.38)* 30-2.3 A (2.30-2.38)* 
umber of reflections 47,507 47,507 
Rwork (%) 25.3 (30.0)/34.2 (43.9) 22.2 (24.4)/18.4 (24.2) 
Riree (%) 29.8 (35.8)/40.5 (49.3) 26.0 (34.2)/24.7 (32.1) 
Protein 8,668 8,653 
Solvent 301 321 
Ligand (FAD/Mg) 06/2 06/2 
B-factors (A?) 
Protein 38.1 32.2 
Ligand (FAD/Mg) 0.0 8.1 
Water 28.6 30.7 
r.m.s.d. 
Bond lengths (A) 0.01 0.008 
Bond angles (°) 8 al, 


Revised data collection and Phenix refinement statistics (corrected Supplementary Table 2 from ref. 1). 
* Highest resolution bin for compiling statistics. 


3TVS 
8TVS rebuilt 


Figure 1 | Superposition of Ca traces for 3TVS and rebuilt dCRY structure. 
3TVS is in yellow, and the rebuilt dCRY structure is in cyan. Both subunits 
within the asymmetric unit are depicted. Flavin cofactors are show in 

orange. 


electron density maps despite identification of this modification by 
mass spectrometry. 

The errors in the original structure stemmed from model bias 
introduced during the detwinning procedure. Lower resolution data 
sets to which the original dCRY structure was built appear to suffer 
more from twinning than the 2.3 A resolution data that the final 
model was refined against. Although the high-resolution data does 
contain indications of non-merohedral twinning, including intensity 
oscillations along the reciprocal space | axis and spurious Patterson 
peaks, a model that agrees well with the diffraction data as collected 
can be produced without compensation for these effects (Table 1). 


3TVS 


3TVS rebuilt Glu 530 Te 


Glu 528 


2 
Phe 534 Gin 533 884 


Phe 535 
Arg 532 \ 


Gln 533 


Trp 536 Phe 534 


Phe 534 


Trp 536 


Figure 2 | Sequence shift in the CTT. Left, superposition of CTT from 3TVS 
(yellow) and rebuilt structure (pink). Sequence register in CTT of rebuilt 
structure is displaced two residues towards the C terminus compared to 
3TVS. Right, electron density for the CTT using phases derived from 

the new model. The Fou; — Foaie omit map electron density is contoured 

at 30. 
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Figure 3 | Flavin centre of dCRY. The corrected Fig. 3c from ref. 1 is shown. 
Superposition of key residues in dCRY flavin centre between 3TVS (yellow) and 
rebuilt structure (pink). Position of Trp 536 and linkage between flavin ring and 
ribityl chain differs in rebuilt structure compared to 3TVS. 


The new atomic coordinates have been deposited in the Protein Data 
Bank as 4GU5. 
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Figure 4 | dCRY Trp triad photoreduction pathway. The corrected Fig. 4a 
from ref. 1 is shown. Arrangement of CRY Trp residues in 3TVS (yellow) and 
rebuilt structure (pink). Trp 536 is replaced by Phe 534 in the rebuilt structure. 
Trp 536 is now separated from the flavin ring by the imidazole side chain of 
His 378 (see Fig. 3). Double asterisks denote that the new positions of Trp 394 
and Trp 420 are nearly identical to the old. 
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